Phi-3 technical report: A highly capable language model locally on your phone M Abdin, SA Jacobs, AA Awan, J Aneja, A Awadallah, H Awadalla, ... arXiv preprint arXiv:2404.14219, 2024 | 89 | 2024 |
Deepspeed-chat: Easy, fast and affordable rlhf training of chatgpt-like models at all scales Z Yao, RY Aminabadi, O Ruwase, S Rajbhandari, X Wu, AA Awan, ... arXiv preprint arXiv:2308.01320, 2023 | 36 | 2023 |
Swift machine learning model serving scheduling: a region based reinforcement learning approach H Qin, S Zawad, Y Zhou, L Yang, D Zhao, F Yan Proceedings of the International Conference for High Performance Computing …, 2019 | 29 | 2019 |
The age of correlated features in supervised learning based forecasting MKC Shisher, H Qin, L Yang, F Yan, Y Sun IEEE INFOCOM 2021-IEEE Conference on Computer Communications Workshops …, 2021 | 17 | 2021 |
Zero++: Extremely efficient collective communication for giant model training G Wang, H Qin, SA Jacobs, C Holmes, S Rajbhandari, O Ruwase, F Yan, ... arXiv preprint arXiv:2306.10209, 2023 | 16 | 2023 |
Reinforcement-learning-empowered MLaaS scheduling for serving intelligent internet of things H Qin, S Zawad, Y Zhou, S Padhi, L Yang, F Yan IEEE Internet of Things Journal 7 (7), 6325-6337, 2020 | 16 | 2020 |
Nemo: An open-source transformer-supercharged benchmark for fine-grained wildfire smoke detection A Yazdi, H Qin, CB Jordan, L Yang, F Yan Remote Sensing 14 (16), 3979, 2022 | 12 | 2022 |
Deepspeed-fastgen: High-throughput text generation for llms via mii and deepspeed-inference C Holmes, M Tanaka, M Wyatt, AA Awan, J Rasley, S Rajbhandari, ... arXiv preprint arXiv:2401.08671, 2024 | 6 | 2024 |
Simigrad: Fine-grained adaptive batching for large scale training using gradient similarity measurement H Qin, S Rajbhandari, O Ruwase, F Yan, L Yang, Y He Advances in Neural Information Processing Systems 34, 20531-20544, 2021 | 4 | 2021 |
ZeRO++: Extremely Efficient Collective Communication for Large Model Training G Wang, H Qin, SA Jacobs, X Wu, C Holmes, Z Yao, S Rajbhandari, ... The Twelfth International Conference on Learning Representations, 2024 | 2 | 2024 |
Scalable and Efficient Machine Learning as a Service H Qin University of Nevada, Reno, 2022 | | 2022 |
The Age of Correlated Features in Supervised Learning based Forecasting M Kamran Chowdhury Shisher, H Qin, L Yang, F Yan, Y Sun arXiv e-prints, arXiv: 2103.00092, 2021 | | 2021 |