A comprehensive survey on trustworthy recommender systems
As one of the most successful AI-powered applications, recommender systems aim to help
people make appropriate decisions in an effective and efficient way, by providing …
people make appropriate decisions in an effective and efficient way, by providing …
Sparse attention acceleration with synergistic in-memory pruning and on-chip recomputation
A Yazdanbakhsh, A Moradifirouzabadi… - 2022 55th IEEE/ACM …, 2022 - ieeexplore.ieee.org
As its core computation, a self-attention mechanism gauges pairwise correlations across the
entire input sequence. Despite favorable performance, calculating pairwise correlations is …
entire input sequence. Despite favorable performance, calculating pairwise correlations is …
NDSEARCH: Accelerating Graph-Traversal-Based Approximate Nearest Neighbor Search through Near Data Processing
Approximate nearest neighbor search (ANNS) is a key retrieval technique for vector
database and many data center applications, such as person re-identification and …
database and many data center applications, such as person re-identification and …
A heterogeneous 3-D stacked PIM accelerator for GCN-based recommender systems
Modern recommendation systems integrate graph convolution neural networks (GCN) for
enhancing embedding representation. Compared with widely deployed neural network …
enhancing embedding representation. Compared with widely deployed neural network …
Heterogeneous Data-Centric Architectures for Modern Data-Intensive Applications: Case Studies in Machine Learning and Databases
Today's computing systems require moving data back-and-forth between computing
resources (eg, CPUs, GPUs, accelerators) and off-chip main memory so that computation …
resources (eg, CPUs, GPUs, accelerators) and off-chip main memory so that computation …
Towards the Efficiency, Heterogeneity, and Robustness of Edge AI
Over the past decade, there has been a persistent trend in edge computing, driving the
migration of intelligence closer to the edge. The increasing need to process data locally has …
migration of intelligence closer to the edge. The increasing need to process data locally has …
ReHarvest: an ADC Resource-Harvesting Crossbar Architecture for ReRAM-Based DNN Accelerators
ReRAM-based Processing-In-Memory (PIM) architectures have been increasingly explored
to accelerate various Deep Neural Network (DNN) applications because they can achieve …
to accelerate various Deep Neural Network (DNN) applications because they can achieve …
Benchmarking DNN Mapping Methods for the In-Memory Computing Accelerators
This paper presents a study of methods for mapping the convolutional workloads in deep
neural networks (DNNs) onto the computing hardware in the in-memory computing (IMC) …
neural networks (DNNs) onto the computing hardware in the in-memory computing (IMC) …
Janus: A Flexible Processing-in-Memory Graph Accelerator Towards Sparsity
Graph application is ever-growing in relational data analysis. However, the memory access
patterns become the performance bottleneck in graph analytics and graph neural network …
patterns become the performance bottleneck in graph analytics and graph neural network …
PIMPR: PIM-based Personalized Recommendation with Heterogeneous Memory Hierarchy
Deep learning-based personalized recommendation models (DLRMs) are dominating AI
tasks in data centers. The performance bottleneck of typical DLRMs mainly lies in the …
tasks in data centers. The performance bottleneck of typical DLRMs mainly lies in the …