A survey on compiler autotuning using machine learning
Since the mid-1990s, researchers have been trying to use machine-learning-based
approaches to solve a number of different compiler optimization problems. These …
approaches to solve a number of different compiler optimization problems. These …
Learning to optimize halide with tree search and random programs
We present a new algorithm to automatically schedule Halide programs for high-
performance image processing and deep learning. We significantly improve upon the …
performance image processing and deep learning. We significantly improve upon the …
Opentuner: An extensible framework for program autotuning
Program autotuning has been shown to achieve better or more portable performance in a
number of domains. However, autotuners themselves are rarely portable between projects …
number of domains. However, autotuners themselves are rarely portable between projects …
Using meta-heuristics and machine learning for software optimization of parallel computing systems: a systematic literature review
While modern parallel computing systems offer high performance, utilizing these powerful
computing resources to the highest possible extent demands advanced knowledge of …
computing resources to the highest possible extent demands advanced knowledge of …
PyCUDA and PyOpenCL: A scripting-based approach to GPU run-time code generation
High-performance computing has recently seen a surge of interest in heterogeneous
systems, with an emphasis on modern Graphics Processing Units (GPUs). These devices …
systems, with an emphasis on modern Graphics Processing Units (GPUs). These devices …
A deep learning based cost model for automatic code optimization
R Baghdadi, M Merouani… - Proceedings of …, 2021 - proceedings.mlsys.org
Enabling compilers to automatically optimize code has been a longstanding goal for the
compiler community. Efficiently solving this problem requires using precise cost models …
compiler community. Efficiently solving this problem requires using precise cost models …
SPIRAL: Extreme performance portability
In this paper, we address the question of how to automatically map computational kernels to
highly efficient code for a wide range of computing platforms and establish the correctness of …
highly efficient code for a wide range of computing platforms and establish the correctness of …
Bridging the gap between deep learning and sparse matrix format selection
This work presents a systematic exploration on the promise and special challenges of deep
learning for sparse matrix format selection---a problem of determining the best storage …
learning for sparse matrix format selection---a problem of determining the best storage …
Mitigating the compiler optimization phase-ordering problem using machine learning
S Kulkarni, J Cavazos - … of the ACM international conference on Object …, 2012 - dl.acm.org
Today's compilers have a plethora of optimizations to choose from, and the correct choice of
optimizations can have a significant impact on the performance of the code being optimized …
optimizations can have a significant impact on the performance of the code being optimized …
Autotuning algorithmic choice for input sensitivity
A daunting challenge faced by program performance autotuning is input sensitivity, where
the best autotuned configuration may vary with different input sets. This paper presents a …
the best autotuned configuration may vary with different input sets. This paper presents a …