Scaling for edge inference of deep neural networks

X Xu, Y Ding, SX Hu, M Niemier, J Cong, Y Hu… - Nature Electronics, 2018 - nature.com
Deep neural networks offer considerable potential across a range of applications, from
advanced manufacturing to autonomous cars. A clear trend in deep neural networks is the …

A domain-specific supercomputer for training deep neural networks

NP Jouppi, DH Yoon, G Kurian, S Li, N Patil… - Communications of the …, 2020 - dl.acm.org
A domain-specific supercomputer for training deep neural networks Page 1 JULY 2020 | VOL.
63 | NO. 7 | COMMUNICATIONS OF THE ACM 67 DOI:10.1145/3360307 Google’s TPU …

In-datacenter performance analysis of a tensor processing unit

NP Jouppi, C Young, N Patil, D Patterson… - Proceedings of the 44th …, 2017 - dl.acm.org
Many architects believe that major improvements in cost-energy-performance must now
come from domain-specific hardware. This paper evaluates a custom ASIC---called a Tensor …

Motivation for and evaluation of the first tensor processing unit

N Jouppi, C Young, N Patil, D Patterson - ieee Micro, 2018 - ieeexplore.ieee.org
The first-generation tensor processing unit (TPU) runs deep neural network (DNN) inference
15-30 times faster with 30-80 times better energy efficiency than contemporary CPUs and …

Graphlab: A new framework for parallel machine learning

Y Low, JE Gonzalez, A Kyrola, D Bickson… - arXiv preprint arXiv …, 2014 - arxiv.org
Designing and implementing efficient, provably correct parallel machine learning (ML)
algorithms is challenging. Existing high-level parallel abstractions like MapReduce are …

Algorithm-based fault tolerance for matrix operations

KH Huang, JA Abraham - IEEE transactions on computers, 1984 - ieeexplore.ieee.org
The rapid progress in VLSI technology has reduced the cost of hardware, allowing multiple
copies of low-cost processors to provide a large amount of computational capability for a …

[图书][B] Models of computation

JE Savage - 1998 - dna.caltech.edu
Models of Computation.ppt [Read-Only] Page 1 Models of Computation John E Savage
Computer Science Brown University CBSSS 2004 July 16, 2004 Page 2 CBSSS: JE Savage …

[图书][B] Algorithmen und datenstrukturen

P Widmayer, T Ottmann - 1993 - Springer
Im Zentrum des Interesses der Informatik hat sich in den letzten Jahren das Gebiet
Algorithmen und Datenstrukturen beachtlich entwickelt. Dabei geht es sowohl um den …

The cube-connected cycles: a versatile network for parallel computation

FP Preparata, J Vuillemin - Communications of the ACM, 1981 - dl.acm.org
An interconnection pattern of processing elements, the cube-connected cycles (CCC), is
introduced which can be used as a general purpose parallel processor. Because its design …

[图书][B] Parallel computation: models and methods

SG Akl - 1997 - dl.acm.org
Parallel computation | Guide books skip to main content ACM Digital Library home ACM home
Google, Inc. (search) Advanced Search Browse About Sign in Register Advanced Search …