[HTML][HTML] Survey of Deep Learning Accelerators for Edge and Emerging Computing

S Alam, C Yakopcic, Q Wu, M Barnell, S Khan… - Electronics, 2024 - mdpi.com
The unprecedented progress in artificial intelligence (AI), particularly in deep learning
algorithms with ubiquitous internet connected smart devices, has created a high demand for …

AM4: MRAM Crossbar Based CAM/TCAM/ACAM/AP for In-Memory Computing

E Garzón, M Lanuzza, A Teman… - IEEE Journal on …, 2023 - ieeexplore.ieee.org
In-memory computing seeks to minimize data movement and alleviate the memory wall by
computing in-situ, in the same place that the data is located. One of the key emerging …

pluto: Enabling massively parallel computation in dram via lookup tables

JD Ferreira, G Falcao, J Gómez-Luna… - 2022 55th IEEE/ACM …, 2022 - ieeexplore.ieee.org
Data movement between the main memory and the processor is a key contributor to
execution time and energy consumption in memory-intensive applications. This data …

AIDA: Associative in-memory deep learning accelerator

E Garzón, A Teman, M Lanuzza, L Yavits - IEEE Micro, 2022 - ieeexplore.ieee.org
This work presents an associative in-memory deep learning processor (AIDA) for edge
devices. An associative processor is a massively parallel non-von Neumann accelerator that …

Accelerating database analytic query workloads using an associative processor

H Caminal, Y Chronis, T Wu, JM Patel… - Proceedings of the 49th …, 2022 - dl.acm.org
Database analytic query workloads are heavy consumers of data-center cycles, and there is
constant demand to improve their performance. Associative processors (AP) have re …

A low-energy DMTJ-based ternary content-addressable memory with reliable sub-nanosecond search operation

E Garzón, L Yavits, G Finocchio, M Carpentieri… - IEEE …, 2023 - ieeexplore.ieee.org
In this paper, we propose an energy-efficient, reliable, hybrid, 10-transistor/2-Double-Barrier-
Magnetic-Tunnel-Junction (10T2DMTJ) non-volatile (NV) ternary content-addressable …

Exploiting Similarity Opportunities of Emerging Vision AI Models on Hybrid Bonding Architecture

Z Yue, H Wang, J Fang, J Deng, G Lu… - 2024 ACM/IEEE 51st …, 2024 - ieeexplore.ieee.org
While extensive research has focused on optimizing performance and efficiency in vision-
based AI accelerators, an unexplored phenomenon, Clustering Similarity Effect, presents a …

EVE: Ephemeral vector engines

K Al-Hawaj, T Ta, N Cebry, S Agwa… - … Symposium on High …, 2023 - ieeexplore.ieee.org
There has been a resurgence of interest in vector architectures evident by recent adoption of
vector extensions in mainstream instruction set architectures. Traditionally, vector engines …

Designing Precharge-Free Energy-Efficient Content-Addressable Memories

R Taco, E Garzón, R Hanhan, A Teman… - … Transactions on Very …, 2024 - ieeexplore.ieee.org
Content-addressable memory (CAM) is a specialized type of memory that facilitates
massively parallel comparison of a search pattern against its entire content. State-of-the-art …

Comprehensive Benchmarking of Binary Neural Networks on NVM Crossbar Architectures

R Huang, Z Yue, C Huang, J Matai, Z Zhang - arXiv preprint arXiv …, 2023 - arxiv.org
Non-volatile memory (NVM) crossbars have been identified as a promising technology, for
accelerating important machine learning operations, with matrix-vector multiplication being a …