Realm: An event-based low-level runtime for distributed memory architectures
S Treichler, M Bauer, A Aiken - … of the 23rd international conference on …, 2014 - dl.acm.org
We present Realm, an event-based runtime system for heterogeneous, distributed memory
machines. Realm is fully asynchronous: all runtime actions are non-blocking. Realm …
machines. Realm is fully asynchronous: all runtime actions are non-blocking. Realm …
mpi4py. futures: MPI-based asynchronous task execution for Python
We present mpi4py. futures, a lightweight, asynchronous task execution framework targeting
the Python programming language and using the Message Passing Interface (MPI) for …
the Python programming language and using the Message Passing Interface (MPI) for …
Streaming message interface: High-performance distributed memory programming on reconfigurable hardware
Distributed memory programming is the established paradigm used in high-performance
computing (HPC) systems, requiring explicit communication between nodes and devices …
computing (HPC) systems, requiring explicit communication between nodes and devices …
RAMP: Research accelerator for multiple processors
The RAMP project's goal is to enable the intensive, multidisciplinary innovation that the
computing industry will need to tackle the problems of parallel processing. RAMP itself is an …
computing industry will need to tackle the problems of parallel processing. RAMP itself is an …
Hybrid MPI: efficient message passing for multi-core systems
A Friedley, G Bronevetsky, T Hoefler… - Proceedings of the …, 2013 - dl.acm.org
Multi-core shared memory architectures are ubiquitous in both High-Performance
Computing (HPC) and commodity systems because they provide an excellent trade-off …
Computing (HPC) and commodity systems because they provide an excellent trade-off …
Runtime mechanisms for efficient dynamic multithreading
V Karamcheti, J Plevyak, AA Chien - Journal of Parallel and Distributed …, 1996 - Elsevier
High performance on distributed memory machines for programming models with dynamic
thread creation and multithreading requires efficient thread management and …
thread creation and multithreading requires efficient thread management and …
[PS][PS] MPI: a standard message passing interface
DW Walker, JJ Dongarra - Supercomputer, 1996 - users.cs.cf.ac.uk
MPI is a proposed standard message passing interface originally designed for writing
applications and libraries for distributed memory environments. The main advantages of …
applications and libraries for distributed memory environments. The main advantages of …
A portable runtime interface for multi-level memory hierarchies
M Houston, JY Park, M Ren, T Knight… - Proceedings of the 13th …, 2008 - dl.acm.org
We present a platform independent runtime interface for moving data and computation
through parallel machines with multi-level memory hierarchies. We show that this interface …
through parallel machines with multi-level memory hierarchies. We show that this interface …
[HTML][HTML] The design and evaluation of a shared object system for distributed memory machines
DJ Scales, MS Lam - Proceedings of the 1st USENIX conference on …, 1994 - usenix.org
This paper describes the design and evaluation of SAM, a shared object system for
distributed memory machines. SAM is a portable run-time system that provides a global …
distributed memory machines. SAM is a portable run-time system that provides a global …
memif Towards Programming Heterogeneous Memory Asynchronously
To harness a heterogeneous memory hierarchy, it is advantageous to integrate application
knowledge in guiding frequent memory move, ie, replicating or migrating virtual memory …
knowledge in guiding frequent memory move, ie, replicating or migrating virtual memory …