作者
Yining Shi, Zhi Yang, Jilong Xue, Lingxiao Ma, Yuqing Xia, Ziming Miao, Yuxiao Guo, Fan Yang, Lidong Zhou
发表日期
2023
研讨会论文
17th USENIX Symposium on Operating Systems Design and Implementation (OSDI 23)
页码范围
701-718
简介
With the growing demand for processing higher fidelity data and the use of faster computing cores in newer hardware accelerators, modern deep neural networks (DNNs) are becoming increasingly memory intensive. A disparity between underutilized computing cores and saturated memory bandwidth has been observed in various popular DNN models. This inefficiency is caused by both the conventional treatment of DNNs as compute-intensive workloads and the lack of holistic memory access optimization in DNN models.
引用总数
学术搜索中的文章
Y Shi, Z Yang, J Xue, L Ma, Y Xia, Z Miao, Y Guo… - 17th USENIX Symposium on Operating Systems …, 2023