Ansor: Generating {High-Performance} tensor programs for deep learning L Zheng, C Jia, M Sun, Z Wu, CH Yu, A Haj-Ali, Y Wang, J Yang, D Zhuo, ... 14th USENIX symposium on operating systems design and implementation (OSDI …, 2020 | 375 | 2020 |
Improving the performance of distributed tensorflow with RDMA C Jia, J Liu, X Jin, H Lin, H An, W Han, Z Wu, M Chi International Journal of Parallel Programming 46, 674-685, 2018 | 43 | 2018 |
Degree-of-node task scheduling of fine-grained parallel programs on heterogeneous systems H Lin, MF Li, CF Jia, JN Liu, H An Journal of Computer Science and Technology 34, 1096-1108, 2019 | 17 | 2019 |
An effective method for operations placement in tensor flow J Liu, C Jia, J Chen, H Lin, X Jin, H An Proceedings of the 3rd International Conference on High Performance …, 2019 | 2 | 2019 |
异构系统上基于加权出度的细粒度并行程序任务调度 H Lin, MF Li, CF Jia, JN Liu, H An 计算机科学技术学报 34 (5), 1096-1108, 2019 | | 2019 |