Programming-by-demonstration for long-horizon robot tasks
The goal of programmatic Learning from Demonstration (LfD) is to learn a policy in a
programming language that can be used to control a robot's behavior from a set of user …
programming language that can be used to control a robot's behavior from a set of user …
Loop rerolling for hardware decompilation
We introduce the new problem of hardware decompilation. Analogous to software
decompilation, hardware decompilation is about analyzing a low-level artifact—in this case …
decompilation, hardware decompilation is about analyzing a low-level artifact—in this case …
Hybf: A hybrid branch fusion strategy for code size reduction
Binary code size is a first-class design consideration in many computing domains and a
critical factor in many more, but compiler optimizations targeting code size are few and often …
critical factor in many more, but compiler optimizations targeting code size are few and often …
Multiple Function Merging for Code Size Reduction
Y Saito, K Sakamoto, H Washizaki… - ACM Transactions on …, 2024 - dl.acm.org
Resource-constrained environments, such as embedded devices, have limited amounts of
memory and storage. Practical programming languages such as C++ and Rust tend to …
memory and storage. Practical programming languages such as C++ and Rust tend to …
Kernel Looping: Eliminating Synchronization Boundaries for Peak Inference Performance
D Koeplinger, D Gandhi, P Nandkar, N Sheeley… - arXiv preprint arXiv …, 2024 - arxiv.org
Token generation speed is critical to power the next wave of AI inference applications. GPUs
significantly underperform during token generation due to synchronization overheads at …
significantly underperform during token generation due to synchronization overheads at …
Binary Folding Compression for Efficient Software Distribution
This paper presents a simple yet effective approach for compressing binary files by detecting
and folding similar patterns. Until now, methods for compressing these files were mostly …
and folding similar patterns. Until now, methods for compressing these files were mostly …
EAtuner: Comparative Study of Evolutionary Algorithms for Compiler Auto-tuning
G Xiao, S Qin, K Li, J Chen, J Yin - 2024 27th International …, 2024 - ieeexplore.ieee.org
The manual adjustment of compilation flags by compiler users is impractical due to the
exponential size of the search space. To address this, machine learning-based compiler …
exponential size of the search space. To address this, machine learning-based compiler …