Hardware approximate techniques for deep neural network accelerators: A survey

G Armeniakos, G Zervakis, D Soudris… - ACM Computing …, 2022 - dl.acm.org
Deep Neural Networks (DNNs) are very popular because of their high performance in
various cognitive tasks in Machine Learning (ML). Recent advancements in DNNs have …

ALWANN: Automatic layer-wise approximation of deep neural network accelerators without retraining

V Mrazek, Z Vasícek, L Sekanina… - 2019 IEEE/ACM …, 2019 - ieeexplore.ieee.org
The state-of-the-art approaches employ approximate computing to reduce the energy
consumption of DNN hardware. Approximate DNNs then require extensive retraining …

Deep neural network approximation for custom hardware: Where we've been, where we're going

E Wang, JJ Davis, R Zhao, HC Ng, X Niu… - ACM Computing …, 2019 - dl.acm.org
Deep neural networks have proven to be particularly effective in visual and audio
recognition tasks. Existing models tend to be computationally expensive and memory …

[图书][B] Efficient processing of deep neural networks

V Sze, YH Chen, TJ Yang, JS Emer - 2020 - Springer
This book provides a structured treatment of the key principles and techniques for enabling
efficient processing of deep neural networks (DNNs). DNNs are currently widely used for …

Weight-oriented approximation for energy-efficient neural network inference accelerators

ZG Tasoulas, G Zervakis… - … on Circuits and …, 2020 - ieeexplore.ieee.org
Current research in the area of Neural Networks (NN) has resulted in performance
advancements for a variety of complex problems. Especially, embedded system applications …

[图书][B] Ristretto: Hardware-oriented approximation of convolutional neural networks

PM Gysel - 2016 - search.proquest.com
Convolutional neural networks (CNN) have achieved major breakthroughs in recent years.
Their performance in computer vision have matched and in some areas even surpassed …

A^ 3: Accelerating attention mechanisms in neural networks with approximation

TJ Ham, SJ Jung, S Kim, YH Oh, Y Park… - … Symposium on High …, 2020 - ieeexplore.ieee.org
With the increasing computational demands of the neural networks, many hardware
accelerators for the neural networks have been proposed. Such existing neural network …

Cnvlutin: Ineffectual-neuron-free deep neural network computing

J Albericio, P Judd, T Hetherington, T Aamodt… - ACM SIGARCH …, 2016 - dl.acm.org
This work observes that a large fraction of the computations performed by Deep Neural
Networks (DNNs) are intrinsically ineffectual as they involve a multiplication where one of …

A survey on the optimization of neural network accelerators for micro-ai on-device inference

AN Mazumder, J Meng, HA Rashid… - IEEE Journal on …, 2021 - ieeexplore.ieee.org
Deep neural networks (DNNs) are being prototyped for a variety of artificial intelligence (AI)
tasks including computer vision, data analytics, robotics, etc. The efficacy of DNNs coincides …

Proteus: Exploiting numerical precision variability in deep neural networks

P Judd, J Albericio, T Hetherington… - Proceedings of the …, 2016 - dl.acm.org
This work exploits the tolerance of Deep Neural Networks (DNNs) to reduced precision
numerical representations and specifically, their recently demonstrated ability to tolerate …