CACTI-P: Architecture-level modeling for SRAM-based structures with advanced leakage reduction...

H Sharma, J Park, N Suda, L Lai… - 2018 ACM/IEEE 45th …, 2018 - ieeexplore.ieee.org

Hardware acceleration of Deep Neural Networks (DNNs) aims to tame their enormous
compute intensity. Fully realizing the potential of acceleration in this domain requires …

被引用次数：644 相关文章所有 13 个版本

[PDF] acm.org

Tetris: Scalable and efficient neural network acceleration with 3d memory

M Gao, J Pu, X Yang, M Horowitz… - Proceedings of the Twenty …, 2017 - dl.acm.org

The high accuracy of deep neural networks (NNs) has led to the development of NN
accelerators that improve performance by two orders of magnitude. However, scaling these …

被引用次数：692 相关文章所有 7 个版本

[PDF] mit.edu

Accelergy: An architecture-level energy estimation methodology for accelerator designs

YN Wu, JS Emer, V Sze - 2019 IEEE/ACM International …, 2019 - ieeexplore.ieee.org

With Moore's law slowing down and Dennard scaling ended, energy-efficient domain-
specific accelerators, such as deep neural network (DNN) processors for machine learning …

被引用次数：254 相关文章所有 10 个版本

[PDF] nsf.gov

Planaria: Dynamic architecture fission for spatial multi-tenant acceleration of deep neural networks

S Ghodrati, BH Ahn, JK Kim, S Kinzer… - 2020 53rd Annual …, 2020 - ieeexplore.ieee.org

Deep Neural Networks (DNNs) have reinvigorated real-world applications that rely on
learning patterns of data and are permeating into different industries and markets. Cloud …

被引用次数：125 相关文章所有 9 个版本

[PDF] nsf.gov

Snapea: Predictive early activation for reducing computation in deep convolutional neural networks

V Akhlaghi, A Yazdanbakhsh, K Samadi… - 2018 ACM/IEEE 45th …, 2018 - ieeexplore.ieee.org

Deep Convolutional Neural Networks (CNNs) perform billions of operations for classifying a
single input. To reduce these computations, this paper offers a solution that leverages a …

被引用次数：197 相关文章所有 10 个版本

Emerging monolithic 3D integration: Opportunities and challenges from the computer system perspective

Y Cheng, X Guo, VF Pavlidis - Integration, 2022 - Elsevier

In the past decade, monolithic three dimensional integrated circuits (M3D-ICs) advance fast
and demonstrate several important breakthroughs in the fabrication process and circuit level …

被引用次数：22 相关文章所有 2 个版本

[PDF] ntnu.no

Efficient invisible speculative execution through selective delay and value prediction

C Sakalis, S Kaxiras, A Ros, A Jimborean… - Proceedings of the 46th …, 2019 - dl.acm.org

Speculative execution, the base on which modern high-performance general-purpose CPUs
are built on, has recently been shown to enable a slew of security attacks. All these attacks …

被引用次数：135 相关文章所有 12 个版本

[PDF] acm.org

The McPAT framework for multicore and manycore architectures: Simultaneously modeling power, area, and timing

S Li, JH Ahn, RD Strong, JB Brockman… - ACM Transactions on …, 2013 - dl.acm.org

This article introduces McPAT, an integrated power, area, and timing modeling framework
that supports comprehensive design space exploration for multicore and manycore …

被引用次数：274 相关文章所有 5 个版本

[PDF] arxiv.org

Smash: Co-designing software compression and hardware-accelerated indexing for efficient sparse matrix operations

K Kanellopoulos, N Vijaykumar, C Giannoula… - Proceedings of the …, 2019 - dl.acm.org

Important workloads, such as machine learning and graph analytics applications, heavily
involve sparse linear algebra operations. These operations use sparse matrix compression …

被引用次数：112 相关文章所有 6 个版本

[PDF] ed.ac.uk

The mondrian data engine

M Drumond, A Daglis, N Mirzadeh, D Ustiugov… - ACM SIGARCH …, 2017 - dl.acm.org

The increasing demand for extracting value out of ever-growing data poses an ongoing
challenge to system designers, a task only made trickier by the end of Dennard scaling. As …

被引用次数：151 相关文章所有 7 个版本