Efficient acceleration of deep learning inference on resource-constrained edge devices: A review
Successful integration of deep neural networks (DNNs) or deep learning (DL) has resulted
in breakthroughs in many areas. However, deploying these highly accurate models for data …
in breakthroughs in many areas. However, deploying these highly accurate models for data …
A survey of accelerator architectures for deep neural networks
Recently, due to the availability of big data and the rapid growth of computing power,
artificial intelligence (AI) has regained tremendous attention and investment. Machine …
artificial intelligence (AI) has regained tremendous attention and investment. Machine …
Machine learning at facebook: Understanding inference at the edge
At Facebook, machine learning provides a wide range of capabilities that drive many
aspects of user experience including ranking posts, content understanding, object detection …
aspects of user experience including ranking posts, content understanding, object detection …
Bit fusion: Bit-level dynamically composable architecture for accelerating deep neural network
Hardware acceleration of Deep Neural Networks (DNNs) aims to tame their enormous
compute intensity. Fully realizing the potential of acceleration in this domain requires …
compute intensity. Fully realizing the potential of acceleration in this domain requires …
[图书][B] Efficient processing of deep neural networks
This book provides a structured treatment of the key principles and techniques for enabling
efficient processing of deep neural networks (DNNs). DNNs are currently widely used for …
efficient processing of deep neural networks (DNNs). DNNs are currently widely used for …
Recnmp: Accelerating personalized recommendation with near-memory processing
Personalized recommendation systems leverage deep learning models and account for the
majority of data center AI cycles. Their performance is dominated by memory-bound sparse …
majority of data center AI cycles. Their performance is dominated by memory-bound sparse …
Mind mappings: enabling efficient algorithm-accelerator mapping space search
Modern day computing increasingly relies on specialization to satiate growing performance
and efficiency requirements. A core challenge in designing such specialized hardware …
and efficiency requirements. A core challenge in designing such specialized hardware …
Hardware acceleration of sparse and irregular tensor computations of ml models: A survey and insights
Machine learning (ML) models are widely used in many important domains. For efficiently
processing these computational-and memory-intensive applications, tensors of these …
processing these computational-and memory-intensive applications, tensors of these …
Hypar: Towards hybrid parallelism for deep learning accelerator array
With the rise of artificial intelligence in recent years, Deep Neural Networks (DNNs) have
been widely used in many domains. To achieve high performance and energy efficiency …
been widely used in many domains. To achieve high performance and energy efficiency …
Non-structured DNN weight pruning—Is it beneficial in any platform?
Large deep neural network (DNN) models pose the key challenge to energy efficiency due
to the significantly higher energy consumption of off-chip DRAM accesses than arithmetic or …
to the significantly higher energy consumption of off-chip DRAM accesses than arithmetic or …