MIPD: An adaptive gradient sparsification framework for distributed DNNs training Z Zhang, C Wang IEEE Transactions on Parallel and Distributed Systems 33 (11), 3053-3066, 2022 | 12 | 2022 |
FPGA-based High-Performance Collision Detection: An Enabling Technique for Image-Guided Robotic Surgery Z Zhang, Xin Y, Liu B, Li WXY, Lee KH, Ng CF, Stoyanov D, Cheung RCC, Kwok KW Frontiers in Robotics and AI, 2016 | 9* | 2016 |
C-coll: Introducing error-bounded lossy compression into mpi collectives J Huang, S Di, X Yu, Y Zhai, J Liu, K Raffenetti, H Zhou, K Zhao, Z Chen, ... arXiv preprint arXiv:2304.03890, 2023 | 8 | 2023 |
SaPus: Self-adaptive parameter update strategy for DNN training on Multi-GPU clusters Z Zhang, C Wang IEEE Transactions on Parallel and Distributed Systems 33 (7), 1569-1580, 2021 | 5 | 2021 |
国家高性能计算环境发展报告: 2002-2017 年 迟学斌 科学出版社, 2018 | 5 | 2018 |
An application specific instruction set processor (asip) for adaptive filters in neural prosthetics Y Xin, WXY Li, Z Zhang, RCC Cheung, D Song, TW Berger IEEE/ACM Transactions on Computational Biology and Bioinformatics 12 (5 …, 2015 | 5 | 2015 |
An Optimized Error-controlled MPI Collective Framework Integrated with Lossy Compression J Huang, S Di, X Yu, Y Zhai, Z Zhang, J Liu, X Lu, K Raffenetti, H Zhou, ... 2024 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2024 | 4 | 2024 |
Momentum-driven adaptive synchronization model for distributed DNN training on HPC clusters Z Zhang, Z Ji, C Wang Journal of Parallel and Distributed Computing 159, 65-84, 2022 | 3 | 2022 |
Development Report on National High Performance Computing Environment (2002-2017)[M] XB Chi Science Press, 51-113, 2018 | 2 | 2018 |
FedFa: A Fully Asynchronous Training Paradigm for Federated Learning H Xu, Z Zhang, S Di, B Liu, A Khalid, J Cao arXiv preprint arXiv:2404.11015, 2024 | | 2024 |
A Survey on Error-Bounded Lossy Compression for Scientific Datasets S Di, J Liu, K Zhao, X Liang, R Underwood, Z Zhang, M Shah, Y Huang, ... arXiv preprint arXiv:2404.02840, 2024 | | 2024 |
POSTER: Accelerating High-Precision Integer Multiplication used in Cryptosystems with GPUs Z Ji, Z Zhang, J Xu, L Ju Proceedings of the 29th ACM SIGPLAN Annual Symposium on Principles and …, 2024 | | 2024 |
An Optimized Error-controlled MPI Collective Framework Integrated with Lossy Compression Jiajun Huang, Sheng Di, Xiaodong Yu, Yujia Zhai, Zhaorui Zhang, Jinyang Liu ... IPDPS: 2024 38th IEEE International Parallel & Distributed Processing Symposium, 2023 | | 2023 |
Accelerating High-Precision Integer Multiplication used in Cryptosystems with GPUs Zhuoran Ji, Zhaorui Zhang, Jiming Xu, Lei Ju PPoPP: ACM SIGPLAN Symposium on Principles and Practice of Parallel …, 2023 | | 2023 |
Efficient parameter update strategy for distributed deep learning system Z Zhang HKU Theses Online (HKUTO), 2021 | | 2021 |