[HTML][HTML] Survey of Deep Learning Accelerators for Edge and Emerging Computing
The unprecedented progress in artificial intelligence (AI), particularly in deep learning
algorithms with ubiquitous internet connected smart devices, has created a high demand for …
algorithms with ubiquitous internet connected smart devices, has created a high demand for …
AM4: MRAM Crossbar Based CAM/TCAM/ACAM/AP for In-Memory Computing
In-memory computing seeks to minimize data movement and alleviate the memory wall by
computing in-situ, in the same place that the data is located. One of the key emerging …
computing in-situ, in the same place that the data is located. One of the key emerging …
pluto: Enabling massively parallel computation in dram via lookup tables
Data movement between the main memory and the processor is a key contributor to
execution time and energy consumption in memory-intensive applications. This data …
execution time and energy consumption in memory-intensive applications. This data …
AIDA: Associative in-memory deep learning accelerator
This work presents an associative in-memory deep learning processor (AIDA) for edge
devices. An associative processor is a massively parallel non-von Neumann accelerator that …
devices. An associative processor is a massively parallel non-von Neumann accelerator that …
Accelerating database analytic query workloads using an associative processor
Database analytic query workloads are heavy consumers of data-center cycles, and there is
constant demand to improve their performance. Associative processors (AP) have re …
constant demand to improve their performance. Associative processors (AP) have re …
A low-energy DMTJ-based ternary content-addressable memory with reliable sub-nanosecond search operation
In this paper, we propose an energy-efficient, reliable, hybrid, 10-transistor/2-Double-Barrier-
Magnetic-Tunnel-Junction (10T2DMTJ) non-volatile (NV) ternary content-addressable …
Magnetic-Tunnel-Junction (10T2DMTJ) non-volatile (NV) ternary content-addressable …
Exploiting Similarity Opportunities of Emerging Vision AI Models on Hybrid Bonding Architecture
While extensive research has focused on optimizing performance and efficiency in vision-
based AI accelerators, an unexplored phenomenon, Clustering Similarity Effect, presents a …
based AI accelerators, an unexplored phenomenon, Clustering Similarity Effect, presents a …
EVE: Ephemeral vector engines
There has been a resurgence of interest in vector architectures evident by recent adoption of
vector extensions in mainstream instruction set architectures. Traditionally, vector engines …
vector extensions in mainstream instruction set architectures. Traditionally, vector engines …
Designing Precharge-Free Energy-Efficient Content-Addressable Memories
Content-addressable memory (CAM) is a specialized type of memory that facilitates
massively parallel comparison of a search pattern against its entire content. State-of-the-art …
massively parallel comparison of a search pattern against its entire content. State-of-the-art …
Comprehensive Benchmarking of Binary Neural Networks on NVM Crossbar Architectures
Non-volatile memory (NVM) crossbars have been identified as a promising technology, for
accelerating important machine learning operations, with matrix-vector multiplication being a …
accelerating important machine learning operations, with matrix-vector multiplication being a …