A survey on compiler autotuning using machine learning

AH Ashouri, W Killian, J Cavazos, G Palermo… - ACM Computing …, 2018 - dl.acm.org
Since the mid-1990s, researchers have been trying to use machine-learning-based
approaches to solve a number of different compiler optimization problems. These …

Learning to optimize halide with tree search and random programs

A Adams, K Ma, L Anderson, R Baghdadi… - ACM Transactions on …, 2019 - dl.acm.org
We present a new algorithm to automatically schedule Halide programs for high-
performance image processing and deep learning. We significantly improve upon the …

Opentuner: An extensible framework for program autotuning

J Ansel, S Kamil, K Veeramachaneni… - Proceedings of the 23rd …, 2014 - dl.acm.org
Program autotuning has been shown to achieve better or more portable performance in a
number of domains. However, autotuners themselves are rarely portable between projects …

Using meta-heuristics and machine learning for software optimization of parallel computing systems: a systematic literature review

S Memeti, S Pllana, A Binotto, J Kołodziej, I Brandic - Computing, 2019 - Springer
While modern parallel computing systems offer high performance, utilizing these powerful
computing resources to the highest possible extent demands advanced knowledge of …

PyCUDA and PyOpenCL: A scripting-based approach to GPU run-time code generation

A Klöckner, N Pinto, Y Lee, B Catanzaro, P Ivanov… - Parallel Computing, 2012 - Elsevier
High-performance computing has recently seen a surge of interest in heterogeneous
systems, with an emphasis on modern Graphics Processing Units (GPUs). These devices …

A deep learning based cost model for automatic code optimization

R Baghdadi, M Merouani… - Proceedings of …, 2021 - proceedings.mlsys.org
Enabling compilers to automatically optimize code has been a longstanding goal for the
compiler community. Efficiently solving this problem requires using precise cost models …

SPIRAL: Extreme performance portability

F Franchetti, TM Low, DT Popovici… - Proceedings of the …, 2018 - ieeexplore.ieee.org
In this paper, we address the question of how to automatically map computational kernels to
highly efficient code for a wide range of computing platforms and establish the correctness of …

Bridging the gap between deep learning and sparse matrix format selection

Y Zhao, J Li, C Liao, X Shen - Proceedings of the 23rd ACM SIGPLAN …, 2018 - dl.acm.org
This work presents a systematic exploration on the promise and special challenges of deep
learning for sparse matrix format selection---a problem of determining the best storage …

Mitigating the compiler optimization phase-ordering problem using machine learning

S Kulkarni, J Cavazos - … of the ACM international conference on Object …, 2012 - dl.acm.org
Today's compilers have a plethora of optimizations to choose from, and the correct choice of
optimizations can have a significant impact on the performance of the code being optimized …

Autotuning algorithmic choice for input sensitivity

Y Ding, J Ansel, K Veeramachaneni, X Shen… - ACM SIGPLAN …, 2015 - dl.acm.org
A daunting challenge faced by program performance autotuning is input sensitivity, where
the best autotuned configuration may vary with different input sets. This paper presents a …