Distributed machine learning through heterogeneous edge systems H Hu, D Wang, C Wu Proceedings of the AAAI Conference on Artificial Intelligence 34 (05), 7179-7186, 2020 | 39 | 2020 |
dpro: A generic performance diagnosis and optimization toolkit for expediting distributed dnn training H Hu, C Jiang, Y Zhong, Y Peng, C Wu, Y Zhu, H Lin, C Guo Proceedings of Machine Learning and Systems 4, 623-637, 2022 | 11 | 2022 |
CDMPP: A Device-Model Agnostic Framework for Latency Prediction of Tensor Programs H Hu, J Su, J Zhao, Y Peng, Y Zhu, H Lin, C Wu Proceedings of the Nineteenth European Conference on Computer Systems, 1054-1074, 2024 | 1 | 2024 |
dPRO: A Generic Profiling and Optimization System for Expediting Distributed DNN Training H Hu, C Jiang, Y Zhong, Y Peng, C Wu, Y Zhu, H Lin, C Guo arXiv preprint arXiv:2205.02473, 2022 | 1 | 2022 |