A flexible approach to autotuning multi-pass machine learning compilers

C Cummins, V Seeker, D Grubisic, M Elhoushi… - arXiv preprint arXiv …, 2023 - arxiv.org

We explore the novel application of Large Language Models to code optimization. We
present a 7B-parameter transformer model trained from scratch to optimize LLVM assembly …

被引用次数：17 相关文章所有 2 个版本

[PDF] neurips.cc

TpuGraphs: A Performance Prediction Dataset on Large Tensor Computational Graphs

M Phothilimthana, S Abu-El-Haija… - Advances in …, 2024 - proceedings.neurips.cc

Precise hardware performance models play a crucial role in code optimizations. They can
assist compilers in making heuristic decisions or aid autotuners in identifying the optimal …

被引用次数：6 相关文章所有 3 个版本

[PDF] acm.org

TelaMalloc: Efficient On-Chip Memory Allocation for Production Machine Learning Accelerators

M Maas, U Beaugnon, A Chauhan, B Ilbeyi - Proceedings of the 28th …, 2022 - dl.acm.org

Memory buffer allocation for on-chip memories is a major challenge in modern machine
learning systems that target ML accelerators. In interactive systems such as mobile phones …

被引用次数：14 相关文章所有 2 个版本

[PDF] acm.org

Neural architecture search using property guided synthesis

C Jin, PM Phothilimthana, S Roy - Proceedings of the ACM on …, 2022 - dl.acm.org

Neural architecture search (NAS) has become an increasingly important tool within the deep
learning community in recent years, yielding many practical advancements in the design of …

被引用次数：7 相关文章所有 6 个版本

[PDF] arxiv.org

Compiler generated feedback for Large Language Models

D Grubisic, C Cummins, V Seeker… - arXiv preprint arXiv …, 2024 - arxiv.org

We introduce a novel paradigm in compiler optimization powered by Large Language
Models with compiler feedback to optimize the code size of LLVM assembly. The model …

被引用次数：2 相关文章所有 2 个版本

[PDF] github.io

ALT: Breaking the Wall between Data Layout and Loop Optimizations for Deep Learning Compilation

Z Xu, J Xu, H Peng, W Wang, X Wang, H Wan… - Proceedings of the …, 2023 - dl.acm.org

Deep learning models rely on highly optimized tensor libraries for efficient inference on
heterogeneous hardware. Current deep compilers typically predetermine layouts of tensors …

被引用次数：3 相关文章所有 2 个版本

[PDF] researchgate.net

Compiler optimization prediction with new self-improved optimization model

C Shewale, SB Shinde, YB Gurav… - International …, 2023 - search.proquest.com

Users may now choose from a vast range of compiler optimizations. These optimizations
interact in a variety of sophisticated ways with one another and with the source code. The …

被引用次数：5 相关文章所有 2 个版本

[PDF] acm.org

The Droplet Search Algorithm for Kernel Scheduling

M Canesche, VM Rosario, E Borin… - ACM Transactions on …, 2023 - dl.acm.org

Kernel scheduling is the problem of finding the most efficient implementation for a
computational kernel. Identifying this implementation involves experimenting with the …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

Optimizing memory mapping using deep reinforcement learning

P Wang, M Sazanovich, B Ilbeyi… - arXiv preprint arXiv …, 2023 - arxiv.org

Resource scheduling and allocation is a critical component of many high impact systems
ranging from congestion control to cloud computing. Finding more optimal solutions to these …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

Mlgoperf: An ml guided inliner to optimize performance

AH Ashouri, M Elhoushi, Y Hua, X Wang… - arXiv preprint arXiv …, 2022 - arxiv.org

For the past 25 years, we have witnessed an extensive application of Machine Learning to
the Compiler space; the selection and the phase-ordering problem. However, limited works …

被引用次数：6 相关文章所有 3 个版本