Design automation framework for application-specific logic-in-memory blocks

S Jain, A Ranjan, K Roy… - IEEE Transactions on …, 2017 - ieeexplore.ieee.org

In-memory computing is a promising approach to addressing the processor-memory data
transfer bottleneck in computing systems. We propose spin-transfer torque compute-in …

被引用次数：401 相关文章所有 8 个版本

[PDF] cmu.edu

Accelerating sparse matrix-matrix multiplication with 3D-stacked logic-in-memory hardware

Q Zhu, T Graf, HE Sumbul, L Pileggi… - 2013 IEEE High …, 2013 - ieeexplore.ieee.org

This paper introduces a 3D-stacked logic-in-memory (LiM) system to accelerate the
processing of sparse matrix data that is held in a 3D DRAM system. We build a customized …

被引用次数：138 相关文章所有 10 个版本

[PDF] cmu.edu

A 3D-stacked logic-in-memory accelerator for application-specific data intensive computing

Q Zhu, B Akin, HE Sumbul, F Sadi… - 2013 IEEE …, 2013 - ieeexplore.ieee.org

This paper introduces a 3D-stacked logic-in-memory (LiM) system that integrates the 3D die-
stacked DRAM architecture with the application-specific LiM IC to accelerate important data …

被引用次数：151 相关文章所有 9 个版本

DynaSpAM: Dynamic spatial architecture mapping using out of order instruction schedules

F Liu, H Ahn, SR Beard, T Oh, DI August - Proceedings of the 42nd …, 2015 - dl.acm.org

Spatial architectures are more efficient than traditional Out-of-Order (OOO) processors for
computationally intensive programs. However, spatial architectures require mapping a …

被引用次数：46 相关文章所有 6 个版本

[PDF] googleapis.com

System and method for in-memory computing

S Jain, A Ranjan, K Roy, A Raghunathan - US Patent 10,073,733, 2018 - Google Patents

A memory capable of carrying out compute-in-memory (CiM) operations is disclosed. The
memory includes a matrix of bit cells having a plurality of bit cells along one or more rows …

被引用次数：31 相关文章所有 2 个版本

[PDF] date-conference.com

Computing-in-memory with spintronics

S Jain, S Sapatnekar, JP Wang, K Roy… - … , Automation & Test …, 2018 - ieeexplore.ieee.org

In-memory computing is a promising approach to alleviating the processor-memory data
transfer bottleneck in computing systems. While spintronics has attracted great interest as a …

被引用次数：28 相关文章所有 11 个版本

[PDF] cmu.edu

Pagerank acceleration for large graphs with scalable hardware and two-step spmv

F Sadi, J Sweeney, S McMillan, TM Low… - 2018 IEEE High …, 2018 - ieeexplore.ieee.org

PageRank is an important vertex ranking algorithm that suffers from poor performance and
efficiency due to notorious memory access behavior. Furthermore, when graphs become …

被引用次数：21 相关文章所有 4 个版本

[PDF] stanford.edu

Design and optimization of a stencil engine

JS Brunhaver II - 2015 - search.proquest.com

Application specific processors exploit the structure of algorithms to reduce energy costs and
increase performance. These kinds of optimizations have become more and more important …

被引用次数：34 相关文章所有 2 个版本

[PDF] ieee.org

Input-aware flow-based computing on memristor crossbars with applications to edge detection

D Chakraborty, S Raj, SL Fernandes… - IEEE Journal on …, 2019 - ieeexplore.ieee.org

Sneak paths in nanoscale memristor crossbars have traditionally been viewed as a problem
in the use of memristor crossbars as non-volatile replacements of traditional volatile RAM …

被引用次数：15 相关文章所有 5 个版本

[PDF] cmu.edu

Understanding the design space of dram-optimized hardware FFT accelerators

B Akın, F Franchetti, JC Hoe - 2014 IEEE 25th International …, 2014 - ieeexplore.ieee.org

As technology scaling is reaching its limits, pointing to the well-known memory and power
wall problems, achieving high-performance and energy-efficient systems is becoming a …

被引用次数：29 相关文章所有 10 个版本