关注
Kawthar Shafie Khorassani
Kawthar Shafie Khorassani
在 amd.com 的电子邮件经过验证
标题
引用次数
引用次数
年份
Nv-group: link-efficient reduction for distributed deep learning on modern dense gpu systems
CH Chu, P Kousha, AA Awan, KS Khorassani, H Subramoni, DK Panda
Proceedings of the 34th ACM International Conference on Supercomputing, 1-12, 2020
422020
Performance evaluation of MPI libraries on GPU-enabled OpenPOWER architectures: Early experiences
KS Khorassani, CH Chu, H Subramoni, DK Panda
High Performance Computing: ISC High Performance 2019 International …, 2019
252019
Designing a ROCm-aware MPI library for AMD GPUs: early experiences
K Shafie Khorassani, J Hashmi, CH Chu, CC Chen, H Subramoni, ...
International Conference on High Performance Computing, 118-136, 2021
142021
Adaptive and hierarchical large message all-to-all communication algorithms for large-scale dense gpu systems
KS Khorassani, CH Chu, QG Anthony, H Subramoni, DK Panda
2021 IEEE/ACM 21st International Symposium on Cluster, Cloud and Internet …, 2021
142021
Accelerating mpi all-to-all communication with online compression on modern gpu clusters
Q Zhou, P Kousha, Q Anthony, K Shafie Khorassani, A Shafi, ...
International Conference on High Performance Computing, 3-25, 2022
132022
High-performance adaptive MPI derived datatype communication for modern Multi-GPU systems
CH Chu, JM Hashmi, KS Khorassani, H Subramoni, DK Panda
2019 IEEE 26th International Conference on High Performance Computing, Data …, 2019
92019
Dynamic kernel fusion for bulk non-contiguous data transfer on GPU clusters
CH Chu, KS Khorassani, Q Zhou, H Subramoni, DK Panda
2020 IEEE International Conference on Cluster Computing (CLUSTER), 130-141, 2020
82020
High performance mpi over the slingshot interconnect: Early experiences
K Shafie Khorassani, CC Chen, B Ramesh, A Shafi, H Subramoni, ...
Practice and Experience in Advanced Research Computing, 1-7, 2022
62022
Implementing and Optimizing a GPU-aware MPI Library for Intel GPUs: Early Experiences
CC Chen, KS Khorassani, GKR Kuncham, R Vaidya, M Abduljabbar, ...
2023 IEEE/ACM 23rd International Symposium on Cluster, Cloud and Internet …, 2023
32023
Highly efficient alltoall and alltoallv communication algorithms for gpu systems
CC Chen, KS Khorassani, QG Anthony, A Shafi, H Subramoni, DK Panda
2022 IEEE International Parallel and Distributed Processing Symposium …, 2022
32022
OMB-UM: Design, implementation, and evaluation of CUDA unified memory aware MPI benchmarks
KV Manian, CH Chu, AA Awan, KS Khorassani, H Subramoni, DK Panda
2019 IEEE/ACM Performance Modeling, Benchmarking and Simulation of High …, 2019
32019
Network assisted non-contiguous transfers for GPU-aware MPI libraries
KK Suresh, KS Khorassani, CC Chen, B Ramesh, M Abduljabbar, A Shafi, ...
2022 IEEE Symposium on High-Performance Interconnects (HOTI), 13-20, 2022
22022
MPI-xCCL: A Portable MPI Library over Collective Communication Libraries for Various Accelerators
CC Chen, K Shafie Khorassani, P Kousha, Q Zhou, J Yao, H Subramoni, ...
Proceedings of the SC'23 Workshops of The International Conference on High …, 2023
12023
Designing and Optimizing GPU-aware Nonblocking MPI Neighborhood Collective Communication for PETSc*
KS Khorassani, CC Chen, H Subramoni, DK Panda
2023 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2023
12023
由 Slingshot 互连的高性能 MPI
KS Khorassani, CC Chen, B Ramesh, A Shafi, H Subramoni, DK Panda
计算机科学技术学报 38 (1), 128-145, 2023
2023
High Performance MPI over the Slingshot Interconnect
KS Khorassani, CC Chen, B Ramesh, A Shafi, H Subramoni, DK Panda
Journal of Computer Science and Technology 38 (1), 128-145, 2023
2023
Accelerating MPI All-to-All Communication with Online Compression on Modern GPU Clusters
KS Khorassani, A Shafi, H Subramoni, DK Panda
High Performance Computing: 37th International Conference, ISC High …, 2022
2022
High Performance Computing: ISC High Performance 2019 International Workshops, Frankfurt, Germany, June 16-20, 2019, Revised Selected Papers
M Weiland, G Juckeland, S Alam, H Jagode
Springer Nature, 2019
2019
Xin, Yao 21 Xu, Yang 21
A Yan, K Al-hemyari, J Carretero, A Cascajo, CC Chen, Y Chen, ...
Kim, Yoonhee 104 Knees, Peter 259 Knüpfer, Andreas 424 Kobayashi, Ryohei 422
M Gerndt, L Gong, AV Goponenko, T Groves, N Gupta, T Hanawa, ...
系统目前无法执行此操作,请稍后再试。
文章 1–20