Rerec: In-reram acceleration with access-aware mapping for personalized recommendation

W Fan, X Zhao, X Chen, J Su, J Gao, L Wang… - arXiv preprint arXiv …, 2022 - arxiv.org

As one of the most successful AI-powered applications, recommender systems aim to help
people make appropriate decisions in an effective and efficient way, by providing …

被引用次数：42 相关文章所有 3 个版本

[PDF] arxiv.org

Sparse attention acceleration with synergistic in-memory pruning and on-chip recomputation

A Yazdanbakhsh, A Moradifirouzabadi… - 2022 55th IEEE/ACM …, 2022 - ieeexplore.ieee.org

As its core computation, a self-attention mechanism gauges pairwise correlations across the
entire input sequence. Despite favorable performance, calculating pairwise correlations is …

被引用次数：24 相关文章所有 5 个版本

NDSEARCH: Accelerating Graph-Traversal-Based Approximate Nearest Neighbor Search through Near Data Processing

Y Wang, S Li, Q Zheng, L Song, Z Li… - 2024 ACM/IEEE 51st …, 2024 - ieeexplore.ieee.org

Approximate nearest neighbor search (ANNS) is a key retrieval technique for vector
database and many data center applications, such as person re-identification and …

A heterogeneous 3-D stacked PIM accelerator for GCN-based recommender systems

X Shen, Y Huang, L Zheng, X Liao, H Jin - CCF Transactions on High …, 2024 - Springer

Modern recommendation systems integrate graph convolution neural networks (GCN) for
enhancing embedding representation. Compared with widely deployed neural network …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

Heterogeneous Data-Centric Architectures for Modern Data-Intensive Applications: Case Studies in Machine Learning and Databases

GF Oliveira, A Boroumand, S Ghose… - 2022 IEEE Computer …, 2022 - ieeexplore.ieee.org

Today's computing systems require moving data back-and-forth between computing
resources (eg, CPUs, GPUs, accelerators) and off-chip main memory so that computation …

被引用次数：6 相关文章所有 10 个版本

Towards the Efficiency, Heterogeneity, and Robustness of Edge AI

B Kim, Z Du, J Sun, Y Chen - 2023 IEEE/ACM International …, 2023 - ieeexplore.ieee.org

Over the past decade, there has been a persistent trend in edge computing, driving the
migration of intelligence closer to the edge. The increasing need to process data locally has …

被引用次数：1 相关文章

[PDF] acm.org Full View

ReHarvest: an ADC Resource-Harvesting Crossbar Architecture for ReRAM-Based DNN Accelerators

J Xu, H Liu, Z Duan, X Liao, H Jin, X Yang, H Li… - ACM Transactions on …, 2024 - dl.acm.org

ReRAM-based Processing-In-Memory (PIM) architectures have been increasingly explored
to accelerate various Deep Neural Network (DNN) applications because they can achieve …

被引用次数：1 相关文章

Benchmarking DNN Mapping Methods for the In-Memory Computing Accelerators

Y Wang, X Fong - IEEE Journal on Emerging and Selected …, 2023 - ieeexplore.ieee.org

This paper presents a study of methods for mapping the convolutional workloads in deep
neural networks (DNNs) onto the computing hardware in the in-memory computing (IMC) …

被引用次数：1 相关文章所有 3 个版本

Janus: A Flexible Processing-in-Memory Graph Accelerator Towards Sparsity

X Li, Z Song, R Ausavarungnirun, X Liu… - … on Computer-Aided …, 2024 - ieeexplore.ieee.org

Graph application is ever-growing in relational data analysis. However, the memory access
patterns become the performance bottleneck in graph analytics and graph neural network …

PIMPR: PIM-based Personalized Recommendation with Heterogeneous Memory Hierarchy

T Yang, H Ma, Y Zhao, F Liu, Z He… - … Design, Automation & …, 2023 - ieeexplore.ieee.org

Deep learning-based personalized recommendation models (DLRMs) are dominating AI
tasks in data centers. The performance bottleneck of typical DLRMs mainly lies in the …

被引用次数：2 相关文章