Ganax: A unified mimd-simd acceleration for generative adversarial networks

MMH Shuvo, SK Islam, J Cheng… - Proceedings of the …, 2022 - ieeexplore.ieee.org

Successful integration of deep neural networks (DNNs) or deep learning (DL) has resulted
in breakthroughs in many areas. However, deploying these highly accurate models for data …

被引用次数：78 相关文章所有 5 个版本

[PDF] sciencedirect.com

A survey of accelerator architectures for deep neural networks

Y Chen, Y Xie, L Song, F Chen, T Tang - Engineering, 2020 - Elsevier

Recently, due to the availability of big data and the rapid growth of computing power,
artificial intelligence (AI) has regained tremendous attention and investment. Machine …

被引用次数：311 相关文章所有 4 个版本

[PDF] umich.edu

Machine learning at facebook: Understanding inference at the edge

CJ Wu, D Brooks, K Chen, D Chen… - … symposium on high …, 2019 - ieeexplore.ieee.org

At Facebook, machine learning provides a wide range of capabilities that drive many
aspects of user experience including ranking posts, content understanding, object detection …

被引用次数：534 相关文章所有 6 个版本

[PDF] arxiv.org

Bit fusion: Bit-level dynamically composable architecture for accelerating deep neural network

H Sharma, J Park, N Suda, L Lai… - 2018 ACM/IEEE 45th …, 2018 - ieeexplore.ieee.org

Hardware acceleration of Deep Neural Networks (DNNs) aims to tame their enormous
compute intensity. Fully realizing the potential of acceleration in this domain requires …

被引用次数：600 相关文章所有 13 个版本

[PDF] mit.edu

[图书][B] Efficient processing of deep neural networks

V Sze, YH Chen, TJ Yang, JS Emer - 2020 - Springer

This book provides a structured treatment of the key principles and techniques for enabling
efficient processing of deep neural networks (DNNs). DNNs are currently widely used for …

被引用次数：259 相关文章所有 6 个版本

[PDF] arxiv.org

Recnmp: Accelerating personalized recommendation with near-memory processing

L Ke, U Gupta, BY Cho, D Brooks… - 2020 ACM/IEEE 47th …, 2020 - ieeexplore.ieee.org

Personalized recommendation systems leverage deep learning models and account for the
majority of data center AI cycles. Their performance is dominated by memory-bound sparse …

被引用次数：210 相关文章所有 11 个版本

[PDF] acm.org

Mind mappings: enabling efficient algorithm-accelerator mapping space search

K Hegde, PA Tsai, S Huang, V Chandra… - Proceedings of the 26th …, 2021 - dl.acm.org

Modern day computing increasingly relies on specialization to satiate growing performance
and efficiency requirements. A core challenge in designing such specialized hardware …

被引用次数：93 相关文章所有 7 个版本

[PDF] ieee.org

Hardware acceleration of sparse and irregular tensor computations of ml models: A survey and insights

S Dave, R Baghdadi, T Nowatzki… - Proceedings of the …, 2021 - ieeexplore.ieee.org

Machine learning (ML) models are widely used in many important domains. For efficiently
processing these computational-and memory-intensive applications, tensors of these …

被引用次数：89 相关文章所有 7 个版本

[PDF] arxiv.org

Hypar: Towards hybrid parallelism for deep learning accelerator array

L Song, J Mao, Y Zhuo, X Qian, H Li… - 2019 IEEE international …, 2019 - ieeexplore.ieee.org

With the rise of artificial intelligence in recent years, Deep Neural Networks (DNNs) have
been widely used in many domains. To achieve high performance and energy efficiency …

被引用次数：129 相关文章所有 8 个版本

[PDF] arxiv.org

Non-structured DNN weight pruning—Is it beneficial in any platform?

X Ma, S Lin, S Ye, Z He, L Zhang… - IEEE transactions on …, 2021 - ieeexplore.ieee.org

Large deep neural network (DNN) models pose the key challenge to energy efficiency due
to the significantly higher energy consumption of off-chip DRAM accesses than arithmetic or …

被引用次数：100 相关文章所有 10 个版本