Describing videos using multi-modal fusion Q Jin, J Chen, S Chen, Y Xiong, A Hauptmann Proceedings of the 24th ACM international conference on Multimedia, 1087-1091, 2016 | 119 | 2016 |
{HiveD}: Sharing a {GPU} cluster for deep learning with guarantees H Zhao, Z Han, Z Yang, Q Zhang, F Yang, L Zhou, M Yang, FCM Lau, ... 14th USENIX symposium on operating systems design and implementation (OSDI …, 2020 | 80 | 2020 |
Tutel: Adaptive mixture-of-experts at scale C Hwang, W Cui, Y Xiong, Z Yang, Z Liu, H Hu, Z Wang, R Salas, J Jose, ... Proceedings of Machine Learning and Systems 5, 2023 | 65* | 2023 |
Mscclang: Microsoft collective communication language M Cowan, S Maleki, M Musuvathi, O Saarikivi, Y Xiong Proceedings of the 28th ACM International Conference on Architectural …, 2023 | 24* | 2023 |
ElasticFlow: An Elastic Serverless Training Platform for Distributed Deep Learning D Gu, Y Zhao, Y Zhong, Y Xiong, Z Han, P Cheng, F Yang, G Huang, X Jin, ... Proceedings of the 28th ACM International Conference on Architectural …, 2023 | 17 | 2023 |
Fp8-lm: Training fp8 large language models H Peng, K Wu, Y Wei, G Zhao, Y Yang, Z Liu, Y Xiong, Z Yang, B Ni, J Hu, ... arXiv preprint arXiv:2310.18313, 2023 | 14 | 2023 |
Moneo: Monitoring fine-grained metrics nonintrusively in AI infrastructure Y Jiang, Y Xiong, L Qu, CL Luo, C Tian, P Cheng, Y Xiong ACM SIGOPS Operating Systems Review 56 (1), 18-25, 2022 | 6* | 2022 |
Semantic image profiling for historic events: Linking images to phrases J Chen, Q Jin, Y Xiong Proceedings of the 24th ACM international conference on Multimedia, 1028-1037, 2016 | 3 | 2016 |
Mixture-of-experts layer with dynamic gating Y Xiong, C Hwang, W Cui, Y Ziyue, Z Liu, H Hu, Z Wang, RO Salas, J Jose, ... US Patent App. 18/054,451, 2024 | | 2024 |
Mixture-of-experts layer with switchable parallel modes Y Xiong, C Hwang, W Cui, Y Ziyue, Z Liu, H Hu, Z Wang, RO Salas, J Jose, ... US Patent App. 18/054,446, 2024 | | 2024 |
Collective communication phases at mixture-of-experts layer Y Xiong, C Hwang, W Cui, Y Ziyue, Z Liu, H Hu, Z Wang, RO Salas, J Jose, ... US Patent App. 18/054,452, 2024 | | 2024 |
Sparse encoding and decoding at mixture-of-experts layer Y Xiong, C Hwang, W Cui, Y Ziyue, Z Liu, H Hu, Z Wang, RO Salas, J Jose, ... US Patent App. 18/318,436, 2024 | | 2024 |
{SuperBench}: Improving Cloud {AI} Infrastructure Reliability with Proactive Validation Y Xiong, Y Jiang, Z Yang, L Qu, G Zhao, S Liu, D Zhong, B Pinzur, ... 2024 USENIX Annual Technical Conference (USENIX ATC 24), 835-850, 2024 | | 2024 |
History Rhyme: Searching Historic Events by Multimedia Knowledge Y Xiong, J Chen, Q Jin, C Zhang Proceedings of the 24th ACM international conference on Multimedia, 749-751, 2016 | | 2016 |