Tpugraphs: A performance prediction dataset on large tensor computational graphs

M Phothilimthana, S Abu-El-Haija… - Advances in …, 2024 - proceedings.neurips.cc
Precise hardware performance models play a crucial role in code optimizations. They can
assist compilers in making heuristic decisions or aid autotuners in identifying the optimal …

WACO: learning workload-aware co-optimization of the format and schedule of a sparse tensor program

J Won, C Mendis, JS Emer… - Proceedings of the 28th …, 2023 - dl.acm.org
In this paper, we present WACO, a novel method of co-optimizing the format and the
schedule of a given sparsity pattern in a sparse tensor program. A core challenge in this …

Tenset: A large-scale program performance dataset for learned tensor compilers

L Zheng, R Liu, J Shao, T Chen… - Thirty-fifth Conference …, 2021 - openreview.net
Search-based tensor compilers can greatly accelerate the execution of machine learning
models by generating high-performance tensor programs, such as matrix multiplications and …

A flexible approach to autotuning multi-pass machine learning compilers

PM Phothilimthana, A Sabne, N Sarda… - 2021 30th …, 2021 - ieeexplore.ieee.org
Search-based techniques have been demonstrated effective in solving complex optimization
problems that arise in domain-specific compilers for machine learning (ML). Unfortunately …

Supersonic: Learning to generate source code optimizations in C/C++

Z Chen, S Fang, M Monperrus - IEEE Transactions on Software …, 2024 - ieeexplore.ieee.org
Software optimization refines programs for resource efficiency while preserving functionality.
Traditionally, it is a process done by developers and compilers. This paper introduces a third …

Tensor program optimization with probabilistic programs

J Shao, X Zhou, S Feng, B Hou, R Lai… - Advances in …, 2022 - proceedings.neurips.cc
Automatic optimization for tensor programs becomes increasingly important as we deploy
deep learning in various environments, and efficient optimization relies on a rich search …

Tlp: A deep learning-based cost model for tensor program tuning

Y Zhai, Y Zhang, S Liu, X Chu, J Peng, J Ji… - Proceedings of the 28th …, 2023 - dl.acm.org
Tensor program tuning is a non-convex objective optimization problem, to which search-
based approaches have proven to be effective. At the core of the search-based approaches …

One-shot tuner for deep learning compilers

J Ryu, E Park, H Sung - Proceedings of the 31st ACM SIGPLAN …, 2022 - dl.acm.org
Auto-tuning DL compilers are gaining ground as an optimizing back-end for DL frameworks.
While existing work can generate deep learning models that exceed the performance of …

PolyTOPS: Reconfigurable and Flexible Polyhedral Scheduler

G Consolaro, Z Zhang, H Razanajato… - 2024 IEEE/ACM …, 2024 - ieeexplore.ieee.org
Polyhedral techniques have been widely used for automatic code optimization in low-level
compilers and higher-level processes. Loop optimization is central to this technique, and …

Transfer-tuning: Reusing auto-schedules for efficient tensor program code generation

P Gibson, J Cano - Proceedings of the International Conference on …, 2022 - dl.acm.org
Auto-scheduling for tensor programs is a process where a search algorithm automatically
explores candidate schedules (program transformations) for a given program on a target …