Efficient processing of deep neural networks: A tutorial and survey
Deep neural networks (DNNs) are currently widely used for many artificial intelligence (AI)
applications including computer vision, speech recognition, and robotics. While DNNs …
applications including computer vision, speech recognition, and robotics. While DNNs …
Flower: A friendly federated learning research framework
Federated Learning (FL) has emerged as a promising technique for edge devices to
collaboratively learn a shared prediction model, while keeping their training data on the …
collaboratively learn a shared prediction model, while keeping their training data on the …
Edge assisted real-time object detection for mobile augmented reality
Most existing Augmented Reality (AR) and Mixed Reality (MR) systems are able to
understand the 3D geometry of the surroundings but lack the ability to detect and classify …
understand the 3D geometry of the surroundings but lack the ability to detect and classify …
Three-dimensional memristor circuits as complex neural networks
Constructing a computing circuit in three dimensions (3D) is a necessary step to enable the
massive connections and efficient communications required in complex neural networks. 3D …
massive connections and efficient communications required in complex neural networks. 3D …
A configurable cloud-scale DNN processor for real-time AI
J Fowers, K Ovtcharov, M Papamichael… - 2018 ACM/IEEE 45th …, 2018 - ieeexplore.ieee.org
Interactive AI-powered services require low-latency evaluation of deep neural network
(DNN) models-aka"" real-time AI"". The growing demand for computationally expensive …
(DNN) models-aka"" real-time AI"". The growing demand for computationally expensive …
[HTML][HTML] Hybrid optical-electronic convolutional neural networks with optimized diffractive optics for image classification
Convolutional neural networks (CNNs) excel in a wide variety of computer vision
applications, but their high performance also comes at a high computational cost. Despite …
applications, but their high performance also comes at a high computational cost. Despite …
In-datacenter performance analysis of a tensor processing unit
Many architects believe that major improvements in cost-energy-performance must now
come from domain-specific hardware. This paper evaluates a custom ASIC---called a Tensor …
come from domain-specific hardware. This paper evaluates a custom ASIC---called a Tensor …
[图书][B] Efficient processing of deep neural networks
This book provides a structured treatment of the key principles and techniques for enabling
efficient processing of deep neural networks (DNNs). DNNs are currently widely used for …
efficient processing of deep neural networks (DNNs). DNNs are currently widely used for …
ISAAC: A convolutional neural network accelerator with in-situ analog arithmetic in crossbars
A Shafiee, A Nag, N Muralimanohar… - ACM SIGARCH …, 2016 - dl.acm.org
A number of recent efforts have attempted to design accelerators for popular machine
learning algorithms, such as those involving convolutional and deep neural networks (CNNs …
learning algorithms, such as those involving convolutional and deep neural networks (CNNs …