A comprehensive survey on trustworthy recommender systems

W Fan, X Zhao, X Chen, J Su, J Gao, L Wang… - arXiv preprint arXiv …, 2022 - arxiv.org
As one of the most successful AI-powered applications, recommender systems aim to help
people make appropriate decisions in an effective and efficient way, by providing …

Sparse attention acceleration with synergistic in-memory pruning and on-chip recomputation

A Yazdanbakhsh, A Moradifirouzabadi… - 2022 55th IEEE/ACM …, 2022 - ieeexplore.ieee.org
As its core computation, a self-attention mechanism gauges pairwise correlations across the
entire input sequence. Despite favorable performance, calculating pairwise correlations is …

NDSEARCH: Accelerating Graph-Traversal-Based Approximate Nearest Neighbor Search through Near Data Processing

Y Wang, S Li, Q Zheng, L Song, Z Li… - 2024 ACM/IEEE 51st …, 2024 - ieeexplore.ieee.org
Approximate nearest neighbor search (ANNS) is a key retrieval technique for vector
database and many data center applications, such as person re-identification and …

A heterogeneous 3-D stacked PIM accelerator for GCN-based recommender systems

X Shen, Y Huang, L Zheng, X Liao, H Jin - CCF Transactions on High …, 2024 - Springer
Modern recommendation systems integrate graph convolution neural networks (GCN) for
enhancing embedding representation. Compared with widely deployed neural network …

Heterogeneous Data-Centric Architectures for Modern Data-Intensive Applications: Case Studies in Machine Learning and Databases

GF Oliveira, A Boroumand, S Ghose… - 2022 IEEE Computer …, 2022 - ieeexplore.ieee.org
Today's computing systems require moving data back-and-forth between computing
resources (eg, CPUs, GPUs, accelerators) and off-chip main memory so that computation …

Towards the Efficiency, Heterogeneity, and Robustness of Edge AI

B Kim, Z Du, J Sun, Y Chen - 2023 IEEE/ACM International …, 2023 - ieeexplore.ieee.org
Over the past decade, there has been a persistent trend in edge computing, driving the
migration of intelligence closer to the edge. The increasing need to process data locally has …

ReHarvest: an ADC Resource-Harvesting Crossbar Architecture for ReRAM-Based DNN Accelerators

J Xu, H Liu, Z Duan, X Liao, H Jin, X Yang, H Li… - ACM Transactions on …, 2024 - dl.acm.org
ReRAM-based Processing-In-Memory (PIM) architectures have been increasingly explored
to accelerate various Deep Neural Network (DNN) applications because they can achieve …

Benchmarking DNN Mapping Methods for the In-Memory Computing Accelerators

Y Wang, X Fong - IEEE Journal on Emerging and Selected …, 2023 - ieeexplore.ieee.org
This paper presents a study of methods for mapping the convolutional workloads in deep
neural networks (DNNs) onto the computing hardware in the in-memory computing (IMC) …

Janus: A Flexible Processing-in-Memory Graph Accelerator Towards Sparsity

X Li, Z Song, R Ausavarungnirun, X Liu… - … on Computer-Aided …, 2024 - ieeexplore.ieee.org
Graph application is ever-growing in relational data analysis. However, the memory access
patterns become the performance bottleneck in graph analytics and graph neural network …

PIMPR: PIM-based Personalized Recommendation with Heterogeneous Memory Hierarchy

T Yang, H Ma, Y Zhao, F Liu, Z He… - … Design, Automation & …, 2023 - ieeexplore.ieee.org
Deep learning-based personalized recommendation models (DLRMs) are dominating AI
tasks in data centers. The performance bottleneck of typical DLRMs mainly lies in the …