Nv-group: link-efficient reduction for distributed deep learning on modern dense gpu systems CH Chu, P Kousha, AA Awan, KS Khorassani, H Subramoni, DK Panda Proceedings of the 34th ACM International Conference on Supercomputing, 1-12, 2020 | 42 | 2020 |
Performance evaluation of MPI libraries on GPU-enabled OpenPOWER architectures: Early experiences KS Khorassani, CH Chu, H Subramoni, DK Panda High Performance Computing: ISC High Performance 2019 International …, 2019 | 25 | 2019 |
Designing a ROCm-aware MPI library for AMD GPUs: early experiences K Shafie Khorassani, J Hashmi, CH Chu, CC Chen, H Subramoni, ... International Conference on High Performance Computing, 118-136, 2021 | 14 | 2021 |
Adaptive and hierarchical large message all-to-all communication algorithms for large-scale dense gpu systems KS Khorassani, CH Chu, QG Anthony, H Subramoni, DK Panda 2021 IEEE/ACM 21st International Symposium on Cluster, Cloud and Internet …, 2021 | 14 | 2021 |
Accelerating mpi all-to-all communication with online compression on modern gpu clusters Q Zhou, P Kousha, Q Anthony, K Shafie Khorassani, A Shafi, ... International Conference on High Performance Computing, 3-25, 2022 | 13 | 2022 |
High-performance adaptive MPI derived datatype communication for modern Multi-GPU systems CH Chu, JM Hashmi, KS Khorassani, H Subramoni, DK Panda 2019 IEEE 26th International Conference on High Performance Computing, Data …, 2019 | 9 | 2019 |
Dynamic kernel fusion for bulk non-contiguous data transfer on GPU clusters CH Chu, KS Khorassani, Q Zhou, H Subramoni, DK Panda 2020 IEEE International Conference on Cluster Computing (CLUSTER), 130-141, 2020 | 8 | 2020 |
High performance mpi over the slingshot interconnect: Early experiences K Shafie Khorassani, CC Chen, B Ramesh, A Shafi, H Subramoni, ... Practice and Experience in Advanced Research Computing, 1-7, 2022 | 6 | 2022 |
Implementing and Optimizing a GPU-aware MPI Library for Intel GPUs: Early Experiences CC Chen, KS Khorassani, GKR Kuncham, R Vaidya, M Abduljabbar, ... 2023 IEEE/ACM 23rd International Symposium on Cluster, Cloud and Internet …, 2023 | 3 | 2023 |
Highly efficient alltoall and alltoallv communication algorithms for gpu systems CC Chen, KS Khorassani, QG Anthony, A Shafi, H Subramoni, DK Panda 2022 IEEE International Parallel and Distributed Processing Symposium …, 2022 | 3 | 2022 |
OMB-UM: Design, implementation, and evaluation of CUDA unified memory aware MPI benchmarks KV Manian, CH Chu, AA Awan, KS Khorassani, H Subramoni, DK Panda 2019 IEEE/ACM Performance Modeling, Benchmarking and Simulation of High …, 2019 | 3 | 2019 |
Network assisted non-contiguous transfers for GPU-aware MPI libraries KK Suresh, KS Khorassani, CC Chen, B Ramesh, M Abduljabbar, A Shafi, ... 2022 IEEE Symposium on High-Performance Interconnects (HOTI), 13-20, 2022 | 2 | 2022 |
MPI-xCCL: A Portable MPI Library over Collective Communication Libraries for Various Accelerators CC Chen, K Shafie Khorassani, P Kousha, Q Zhou, J Yao, H Subramoni, ... Proceedings of the SC'23 Workshops of The International Conference on High …, 2023 | 1 | 2023 |
Designing and Optimizing GPU-aware Nonblocking MPI Neighborhood Collective Communication for PETSc* KS Khorassani, CC Chen, H Subramoni, DK Panda 2023 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2023 | 1 | 2023 |
由 Slingshot 互连的高性能 MPI KS Khorassani, CC Chen, B Ramesh, A Shafi, H Subramoni, DK Panda 计算机科学技术学报 38 (1), 128-145, 2023 | | 2023 |
High Performance MPI over the Slingshot Interconnect KS Khorassani, CC Chen, B Ramesh, A Shafi, H Subramoni, DK Panda Journal of Computer Science and Technology 38 (1), 128-145, 2023 | | 2023 |
Accelerating MPI All-to-All Communication with Online Compression on Modern GPU Clusters KS Khorassani, A Shafi, H Subramoni, DK Panda High Performance Computing: 37th International Conference, ISC High …, 2022 | | 2022 |
High Performance Computing: ISC High Performance 2019 International Workshops, Frankfurt, Germany, June 16-20, 2019, Revised Selected Papers M Weiland, G Juckeland, S Alam, H Jagode Springer Nature, 2019 | | 2019 |
Xin, Yao 21 Xu, Yang 21 A Yan, K Al-hemyari, J Carretero, A Cascajo, CC Chen, Y Chen, ... | | |
Kim, Yoonhee 104 Knees, Peter 259 Knüpfer, Andreas 424 Kobayashi, Ryohei 422 M Gerndt, L Gong, AV Goponenko, T Groves, N Gupta, T Hanawa, ... | | |