FPGA-based accelerators of deep learning networks for learning and classification: A review
Due to recent advances in digital technologies, and availability of credible data, an area of
artificial intelligence, deep learning, has emerged and has demonstrated its ability and …
artificial intelligence, deep learning, has emerged and has demonstrated its ability and …
Scalable deep learning on distributed infrastructures: Challenges, techniques, and tools
R Mayer, HA Jacobsen - ACM Computing Surveys (CSUR), 2020 - dl.acm.org
Deep Learning (DL) has had an immense success in the recent past, leading to state-of-the-
art results in various domains, such as image recognition and natural language processing …
art results in various domains, such as image recognition and natural language processing …
Neurosurgeon: Collaborative intelligence between the cloud and mobile edge
The computation for today's intelligent personal assistants such as Apple Siri, Google Now,
and Microsoft Cortana, is performed in the cloud. This cloud-only approach requires …
and Microsoft Cortana, is performed in the cloud. This cloud-only approach requires …
ISAAC: A convolutional neural network accelerator with in-situ analog arithmetic in crossbars
A Shafiee, A Nag, N Muralimanohar… - ACM SIGARCH …, 2016 - dl.acm.org
A number of recent efforts have attempted to design accelerators for popular machine
learning algorithms, such as those involving convolutional and deep neural networks (CNNs …
learning algorithms, such as those involving convolutional and deep neural networks (CNNs …
The architectural implications of autonomous driving: Constraints and acceleration
Autonomous driving systems have attracted a significant amount of interest recently, and
many industry leaders, such as Google, Uber, Tesla, and Mobileye, have invested a large …
many industry leaders, such as Google, Uber, Tesla, and Mobileye, have invested a large …
Benchmarking TPU, GPU, and CPU platforms for deep learning
Training deep learning models is compute-intensive and there is an industry-wide trend
towards hardware specialization to improve performance. To systematically benchmark …
towards hardware specialization to improve performance. To systematically benchmark …
A cloud-scale acceleration architecture
Hyperscale datacenter providers have struggled to balance the growing need for
specialized hardware (efficiency) with the economic benefits of homogeneity …
specialized hardware (efficiency) with the economic benefits of homogeneity …
Serving heterogeneous machine learning models on {Multi-GPU} servers with {Spatio-Temporal} sharing
As machine learning (ML) techniques are applied to a widening range of applications, high
throughput ML inference serving has become critical for online services. Such ML inference …
throughput ML inference serving has become critical for online services. Such ML inference …
Fused-layer CNN accelerators
Deep convolutional neural networks (CNNs) are rapidly becoming the dominant approach to
computer vision and a major component of many other pervasive machine learning tasks …
computer vision and a major component of many other pervasive machine learning tasks …
From high-level deep neural models to FPGAs
Deep Neural Networks (DNNs) are compute-intensive learning models with growing
applicability in a wide range of domains. FPGAs are an attractive choice for DNNs since they …
applicability in a wide range of domains. FPGAs are an attractive choice for DNNs since they …