Lingxiao Ma 个人学术档案 - 学术资源搜索

引用次数

	总计	2019 年至今
引用	913	900
h 指数	15	15
i10 指数	15	15

300

150

225

201820192020202120222023202412 21 60 146 209 287 174

开放获取的出版物数量

查看全部

9 篇文章

1 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

关注

Lingxiao Ma

Senior Researcher, Microsoft Research

在 pku.edu.cn 的电子邮件经过验证 - 首页

Systems for Machine Learning GPU


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
NeuGraph: Parallel Deep Neural Network Computation on Large Graphs L Ma, Z Yang, Y Miao, J Xue, M Wu, L Zhou, Y Dai 2019 {USENIX} Annual Technical Conference ({USENIX}{ATC} 19), 443-458, 2019	251	2019
Rammer: Enabling Holistic Deep Learning Compiler Optimizations with rTasks L Ma, Z Xie, Z Yang, J Xue, Y Miao, W Cui, W Hu, F Yang, L Zhang, ... 14th {USENIX} Symposium on Operating Systems Design and Implementation …, 2020	111	2020
SeerNet: Predicting Convolutional Neural Network Feature-Map Sparsity Through Low-Bit Quantization S Cao, L Ma, W Xiao, C Zhang, Y Liu, L Zhang, L Nie, Z Yang Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2019	82	2019
Garaph: efficient GPU-accelerated graph processing on a single machine with balanced replication L Ma, Z Yang, H Chen, J Xue, Y Dai 2017 USENIX Annual Technical Conference (USENIX ATC 17), 195-207, 2017	78	2017
Architectural Implications of Graph Neural Networks Z Zhang, J Leng, L Ma, Y Miao, C Li, M Guo IEEE Computer Architecture Letters 19 (1), 59-62, 2020	53	2020
PCGCN: Partition-Centric Processing for Accelerating Graph Convolutional Network C Tian, L Ma, Z Yang, Y Dai 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2020	46	2020
{ROLLER}: Fast and Efficient Tensor Compilation for Deep Learning H Zhu, R Wu, Y Diao, S Ke, H Li, C Zhang, J Xue, L Ma, Y Xia, W Cui, ... 16th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2022	45	2022
Heterogeneity-Aware Distributed Machine Learning Training via Partial Reduce X Miao, X Nie, Y Shao, Z Yang, J Jiang, L Ma, B Cui Proceedings of the 2021 International Conference on Management of Data, 2262 …, 2021	42	2021
Towards Efficient Large-Scale Graph Neural Network Computing L Ma, Z Yang, Y Miao, J Xue, M Wu, L Zhou, Y Dai arXiv preprint arXiv:1810.08403, 2018	35	2018
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits S Ma, H Wang, L Ma, L Wang, W Wang, S Huang, L Dong, R Wang, J Xue, ... arXiv preprint arXiv:2402.17764, 2024	28	2024
{SparTA}:{Deep-Learning} Model Sparsity via {Tensor-with-Sparsity-Attribute} N Zheng, B Lin, Q Zhang, L Ma, Y Yang, F Yang, Y Wang, M Yang, L Zhou 16th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2022	26	2022
Dense-to-Sparse Gate for Mixture-of-Experts X Nie, S Cao, X Miao, L Ma, J Xue, Y Miao, Z Yang, Z Yang, B Cui arXiv preprint arXiv:2112.14397, 2021	22	2021
Bitnet: Scaling 1-bit transformers for large language models H Wang, S Ma, L Dong, S Huang, H Wang, L Ma, F Yang, R Wang, Y Wu, ... arXiv preprint arXiv:2310.11453, 2023	19	2023
Evomoe: An evolutional mixture-of-experts training framework via dense-to-sparse gate X Nie, X Miao, S Cao, L Ma, Q Liu, J Xue, Y Miao, Y Liu, Z Yang, B Cui arXiv preprint arXiv:2112.14397, 2021	16	2021
FlexMoE: Scaling Large-scale Sparse Pre-trained Model Training via Dynamic Device Placement X Nie, X Miao, Z Wang, Z Yang, J Xue, L Ma, G Cao, B Cui Proceedings of the ACM on Management of Data 1 (1), 1-19, 2023	15	2023
Optimizing Dynamic Neural Networks with Brainstorm W Cui, Z Han, L Ouyang, Y Wang, N Zheng, L Ma, Y Yang, F Yang, J Xue, ... 17th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2023	9	2023
Accelerating GNN training with locality-aware partial execution T Kim, C Hwang, KS Park, Z Lin, P Cheng, Y Miao, L Ma, Y Xiong Proceedings of the 12th ACM SIGOPS Asia-Pacific Workshop on Systems, 34-41, 2021	9	2021
Welder: Scheduling Deep Learning Memory Access via Tile-graph Y Shi, Z Yang, J Xue, L Ma, Y Xia, Z Miao, Y Guo, F Yang, L Zhou 17th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2023	7	2023
PIT: Optimization of Dynamic Sparse Deep Learning Models via Permutation Invariant Transformation N Zheng, H Jiang, Q Zhang, Z Han, L Ma, Y Yang, F Yang, C Zhang, L Qiu, ... Proceedings of the 29th Symposium on Operating Systems Principles, 331-347, 2023	5	2023
Efficient GPU Kernels for N: M-Sparse Weights in Deep Learning B Lin, N Zheng, L Wang, S Cao, L Ma, Q Zhang, Y Zhu, T Cao, J Xue, ... Proceedings of Machine Learning and Systems 5, 2023	5	2023

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

引用