Compiler support for sparse tensor computations in MLIR

A Bik, P Koanantakool, T Shpeisman… - ACM Transactions on …, 2022 - dl.acm.org
Sparse tensors arise in problems in science, engineering, machine learning, and data
analytics. Programs that operate on such tensors can exploit sparsity to reduce storage …

Towards efficient sparse matrix vector multiplication on real processing-in-memory architectures

C Giannoula, I Fernandez, J Gómez-Luna… - ACM SIGMETRICS …, 2022 - dl.acm.org
Several manufacturers have already started to commercialize near-bank Processing-In-
Memory (PIM) architectures, after decades of research efforts. Near-bank PIM architectures …

Accelerating sparse MTTKRP for tensor decomposition on FPGA

S Wijeratne, TY Wang, R Kannan… - Proceedings of the 2023 …, 2023 - dl.acm.org
Sparse Matricized Tensor Times Khatri-Rao Product (spMTTKRP) is the most
computationally intensive kernel in sparse tensor decomposition. In this paper, we propose …

Efficient, out-of-memory sparse MTTKRP on massively parallel architectures

A Nguyen, AE Helal, F Checconi… - Proceedings of the 36th …, 2022 - dl.acm.org
Tensor decomposition (TD) is an important method for extracting latent information from high-
dimensional (multi-modal) sparse data. This study presents a novel framework for …

Code synthesis for sparse tensor format conversion and optimization

T Popoola, T Zhao, A St. George, K Bhetwal… - Proceedings of the 21st …, 2023 - dl.acm.org
Many scientific applications compute using sparse data and store that data in a variety of
sparse formats because each format has unique space and performance benefits …

Dynasor: A dynamic memory layout for accelerating sparse mttkrp for tensor decomposition on multi-core cpu

S Wijeratne, R Kannan… - 2023 IEEE 35th …, 2023 - ieeexplore.ieee.org
Sparse Matricized Tensor Times Khatri-Rao Prod-uct (spMTTKRP) is the most time-
consuming compute kernel in sparse tensor decomposition. In this paper, we introduce a …

Sparsity-aware tensor decomposition

SE Kurt, S Raje, A Sukumaran-Rajam… - 2022 IEEE …, 2022 - ieeexplore.ieee.org
Sparse tensor decomposition, such as Canonical Polyadic Decomposition (CPD), is a key
operation for data analytics and machine learning. Its computation is dominated by a set of …

Exploring data layout for sparse tensor times dense matrix on GPUs

K Ahmad, C Cecka, M Garland, M Hall - ACM Transactions on …, 2024 - dl.acm.org
An important sparse tensor computation is sparse-tensor-dense-matrix multiplication
(SpTM), which is used in tensor decomposition and applications. SpTM is a multi …

Accelerating Irregular Applications via Efficient Synchronization and Data Access Techniques

C Giannoula - arXiv preprint arXiv:2211.05908, 2022 - arxiv.org
Irregular applications comprise an increasingly important workload domain for many fields,
including bioinformatics, chemistry, physics, social sciences and machine learning …

Dynamic Tensor Linearization and Time Slicing for Efficient Factorization of Infinite Data Streams

Y Soh, AE Helal, F Checconi… - 2023 IEEE …, 2023 - ieeexplore.ieee.org
Streaming tensor factorization is an effective tool for unsupervised analysis of time-evolving
sparse data, which emerge in many critical domains such as cybersecurity and trend …