Learning to optimize halide with tree search and random programs

A Adams, K Ma, L Anderson, R Baghdadi… - ACM Transactions on …, 2019 - dl.acm.org
We present a new algorithm to automatically schedule Halide programs for high-
performance image processing and deep learning. We significantly improve upon the …

Schedule synthesis for halide pipelines on gpus

S Sioutas, S Stuijk, T Basten, H Corporaal… - ACM Transactions on …, 2020 - dl.acm.org
The Halide DSL and compiler have enabled high-performance code generation for image
processing pipelines targeting heterogeneous architectures through the separation of …

Efficient automatic scheduling of imaging and vision pipelines for the GPU

L Anderson, A Adams, K Ma, TM Li, T Jin… - Proceedings of the …, 2021 - dl.acm.org
We present a new algorithm to quickly generate high-performance GPU implementations of
complex imaging and vision pipelines, directly from high-level Halide algorithm code. It is …

SlidingConv: Domain-Specific Description of Sliding Discrete Cosine Transform Convolution for Halide

Y Kanetaka, H Takagi, Y Maeda, N Fukushima - IEEE Access, 2023 - ieeexplore.ieee.org
Filtering is a fundamental tool in image processing, and its acceleration affects many
applications. Therefore, various algorithmic and hardware accelerations have been …

A graph neural network-based performance model for deep learning applications

S Singh, J Hegarty, H Leather, B Steiner - Proceedings of the 6th ACM …, 2022 - dl.acm.org
The unprecedented proliferation of machine learning based software brings an ever-
increasing need to optimize the implementation of such applications. State-of-the-art …

Aδ autodiff for discontinuous programs - applied to shaders

Y Yang, C Barnes, A Adams, A Finkelstein - ACM Transactions on …, 2022 - dl.acm.org
Over the last decade, automatic differentiation (AD) has profoundly impacted graphics and
vision applications---both broadly via deep learning and specifically for inverse rendering …

Thallo–scheduling for high-performance large-scale non-linear least-squares solvers

M Mara, F Heide, M Zollhöfer, M Nießner… - ACM Transactions on …, 2021 - dl.acm.org
Large-scale optimization problems at the core of many graphics, vision, and imaging
applications are often implemented by hand in tedious and error-prone processes in order to …

Using graph neural networks to model the performance of deep neural networks

S Singh, B Steiner, J Hegarty, H Leather - arXiv preprint arXiv:2108.12489, 2021 - arxiv.org
With the unprecedented proliferation of machine learning software, there is an ever-
increasing need to generate efficient code for such applications. State-of-the-art deep …

Efficient execution of microscopy image analysis on distributed memory hybrid machines

WO Barreiros Júnior - 2024 - rlbea.unb.br
A análise de imagens de whole slide tissue image (WSIs) é uma tarefa computacionalmente
cara, impactando negativamente no uso de dados de patologia em imagens em larga …

Machine Learning in Compiler Optimization

A Haj-Ali - 2020 - search.proquest.com
The end of Moore's law is driving the search for new techniques to improve system
performance as applications continue to evolve rapidly and computing power demands …