UCX: an open source framework for HPC network APIs and beyond P Shamis, MG Venkata, MG Lopez, MB Baker, O Hernandez, Y Itigin, ... 2015 IEEE 23rd Annual Symposium on High-Performance Interconnects, 40-43, 2015 | 195 | 2015 |
Cross-channel network operation offloading for collective operations N Bloch, G Bloch, A Shachar, H Chapman, I Rabinovitz, P Shamis, ... US Patent 8,811,417, 2014 | 184 | 2014 |
ConnectX-2 InfiniBand management queues: First investigation of the new support for network offloaded collective operations RL Graham, S Poole, P Shamis, G Bloch, N Bloch, H Chapman, M Kagan, ... 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid …, 2010 | 68 | 2010 |
Overlapping computation and communication: Barrier algorithms and ConnectX-2 CORE-Direct capabilities RL Graham, S Poole, P Shamis, G Bloch, N Bloch, H Chapman, M Kagan, ... 2010 IEEE International Symposium on Parallel & Distributed Processing …, 2010 | 59 | 2010 |
Cheetah: A framework for scalable hierarchical collective operations R Graham, MG Venkata, J Ladd, P Shamis, I Rabinovitz, V Filipov, ... 2011 11th IEEE/ACM International Symposium on Cluster, Cloud and Grid …, 2011 | 56 | 2011 |
Connectx-2 core-direct enabled asynchronous broadcast collective communications MG Venkata, RL Graham, JS Ladd, P Shamis, I Rabinovitz, V Filipov, ... 2011 IEEE International Symposium on Parallel and Distributed Processing …, 2011 | 27 | 2011 |
Designing a high performance openshmem implementation using universal common communication substrate as a communication middleware P Shamis, MG Venkata, S Poole, A Welch, T Curtis OpenSHMEM and Related Technologies. Experiences, Implementations, and Tools …, 2014 | 24 | 2014 |
X-SRQ - Improving Scalability and Performance of Multi-core InfiniBand Clusters GM Shipman, S Poole, P Shamis, I Rabinovitz Recent Advances in Parallel Virtual Machine and Message Passing Interface …, 2008 | 20 | 2008 |
Optimizing blocking and nonblocking reduction operations for multicore systems: Hierarchical design and implementation MG Venkata, P Shamis, R Sampath, RL Graham, JS Ladd 2013 IEEE International Conference on Cluster Computing (CLUSTER), 1-8, 2013 | 19 | 2013 |
Exploring OpenSHMEM model to program GPU-based extreme-scale systems S Potluri, D Rossetti, D Becker, D Poole, M Gorentla Venkata, ... OpenSHMEM and Related Technologies. Experiences, Implementations, and …, 2015 | 18 | 2015 |
OpenSHMEM-UCX: evaluation of UCX for implementing OpenSHMEM programming model M Baker, F Aderholdt, MG Venkata, P Shamis OpenSHMEM and Related Technologies. Enhancing OpenSHMEM for Hybrid …, 2016 | 17 | 2016 |
Extending the OpenSHMEM memory model to support user-defined spaces A Welch, S Pophale, P Shamis, O Hernandez, S Poole, B Chapman Proceedings of the 8th International Conference on Partitioned Global …, 2014 | 17 | 2014 |
Network Offloaded Hierarchical Collectives Using ConnectX-2’s CORE-Direct Capabilities I Rabinovitz, P Shamis, RL Graham, N Bloch, G Shainer Recent Advances in the Message Passing Interface: 17th European MPI Users …, 2010 | 17 | 2010 |
Cross-channel network operation offloading for collective operations N Bloch, G Bloch, A Shachar, H Chapman, I Rabinobitz, P Shamis, ... US Patent 9,344,490, 2016 | 14 | 2016 |
Breaking band: A breakdown of high-performance communication R Zambre, M Grodowitz, A Chandramowlishwaran, P Shamis Proceedings of the 48th International Conference on Parallel Processing, 1-10, 2019 | 12 | 2019 |
The co-design architecture for exascale systems, a novel approach for scalable designs G Shainer, T Wilde, P Lui, T Liu, M Kagan, M Dubman, Y Shahar, ... Computer Science-Research and Development 28, 119-125, 2013 | 11 | 2013 |
Memory access monitoring GW Blake, P Shamis US Patent 10,649,684, 2020 | 10 | 2020 |
Using arm scalable vector extension to optimize open mpi D Zhong, P Shamis, Q Cao, G Bosilca, S Sumimoto, K Miura, J Dongarra 2020 20th IEEE/ACM International Symposium on Cluster, Cloud and Internet …, 2020 | 10 | 2020 |
Fault tolerance for openshmem P Hao, P Shamis, MG Venkata, S Pophale, A Welch, S Poole, B Chapman Proceedings of the 8th International Conference on Partitioned Global …, 2014 | 10 | 2014 |
Exploring the all-to-all collective optimization space with connectx core-direct MG Venkata, RL Graham, J Ladd, P Shamis 2012 41st International Conference on Parallel Processing, 289-298, 2012 | 10 | 2012 |