Learning to optimize halide with tree search and random programs
We present a new algorithm to automatically schedule Halide programs for high-
performance image processing and deep learning. We significantly improve upon the …
performance image processing and deep learning. We significantly improve upon the …
Schedule synthesis for halide pipelines on gpus
The Halide DSL and compiler have enabled high-performance code generation for image
processing pipelines targeting heterogeneous architectures through the separation of …
processing pipelines targeting heterogeneous architectures through the separation of …
Efficient automatic scheduling of imaging and vision pipelines for the GPU
We present a new algorithm to quickly generate high-performance GPU implementations of
complex imaging and vision pipelines, directly from high-level Halide algorithm code. It is …
complex imaging and vision pipelines, directly from high-level Halide algorithm code. It is …
SlidingConv: Domain-Specific Description of Sliding Discrete Cosine Transform Convolution for Halide
Y Kanetaka, H Takagi, Y Maeda, N Fukushima - IEEE Access, 2023 - ieeexplore.ieee.org
Filtering is a fundamental tool in image processing, and its acceleration affects many
applications. Therefore, various algorithmic and hardware accelerations have been …
applications. Therefore, various algorithmic and hardware accelerations have been …
A graph neural network-based performance model for deep learning applications
The unprecedented proliferation of machine learning based software brings an ever-
increasing need to optimize the implementation of such applications. State-of-the-art …
increasing need to optimize the implementation of such applications. State-of-the-art …
Aδ autodiff for discontinuous programs - applied to shaders
Over the last decade, automatic differentiation (AD) has profoundly impacted graphics and
vision applications---both broadly via deep learning and specifically for inverse rendering …
vision applications---both broadly via deep learning and specifically for inverse rendering …
Thallo–scheduling for high-performance large-scale non-linear least-squares solvers
Large-scale optimization problems at the core of many graphics, vision, and imaging
applications are often implemented by hand in tedious and error-prone processes in order to …
applications are often implemented by hand in tedious and error-prone processes in order to …
Using graph neural networks to model the performance of deep neural networks
With the unprecedented proliferation of machine learning software, there is an ever-
increasing need to generate efficient code for such applications. State-of-the-art deep …
increasing need to generate efficient code for such applications. State-of-the-art deep …
Efficient execution of microscopy image analysis on distributed memory hybrid machines
WO Barreiros Júnior - 2024 - rlbea.unb.br
A análise de imagens de whole slide tissue image (WSIs) é uma tarefa computacionalmente
cara, impactando negativamente no uso de dados de patologia em imagens em larga …
cara, impactando negativamente no uso de dados de patologia em imagens em larga …
Machine Learning in Compiler Optimization
A Haj-Ali - 2020 - search.proquest.com
The end of Moore's law is driving the search for new techniques to improve system
performance as applications continue to evolve rapidly and computing power demands …
performance as applications continue to evolve rapidly and computing power demands …