CAFQA: A classical simulation bootstrap for variational quantum algorithms
Classical computing plays a critical role in the advancement of quantum frontiers in the NISQ
era. In this spirit, this work uses classical simulation to bootstrap Variational Quantum …
era. In this spirit, this work uses classical simulation to bootstrap Variational Quantum …
Automating reinforcement learning architecture design for code optimization
Reinforcement learning (RL) is emerging as a powerful technique for solving complex code
optimization tasks with an ample search space. While promising, existing solutions require a …
optimization tasks with an ample search space. While promising, existing solutions require a …
ytopt: Autotuning scientific applications for energy efficiency at large scales
As we enter the exascale computing era, efficiently utilizing power and optimizing the
performance of scientific applications under power and energy constraints has become …
performance of scientific applications under power and energy constraints has become …
Ribbon: cost-effective and qos-aware deep learning model inference using a diverse pool of cloud computing instances
Deep learning model inference is a key service in many businesses and scientific discovery
processes. This paper introduces Ribbon, a novel deep learning inference serving system …
processes. This paper introduces Ribbon, a novel deep learning inference serving system …
Scalable Tuning of (OpenMP) GPU Applications via Kernel Record and Replay
HPC is a heterogeneous world in which host and device code are interleaved throughout
the application. Given the significant performance advantage of accelerators, device code …
the application. Given the significant performance advantage of accelerators, device code …
Harnessing the crowd for autotuning high-performance computing applications
This paper presents GPTuneCrowd, a crowd-based autotuning framework for tuning high-
performance computing applications. GPTuneCrowd collects performance data from various …
performance computing applications. GPTuneCrowd collects performance data from various …
Performance optimization using multimodal modeling and heterogeneous gnn
Growing heterogeneity and configurability in HPC architectures has made auto-tuning
applications and runtime parameters on these systems very complex. Users are presented …
applications and runtime parameters on these systems very complex. Users are presented …
Baco: A fast and portable Bayesian compiler optimization framework
We introduce the Bayesian Compiler Optimization framework (BaCO), a general purpose
autotuner for modern compilers targeting CPUs, GPUs, and FPGAs. BaCO provides the …
autotuner for modern compilers targeting CPUs, GPUs, and FPGAs. BaCO provides the …
Phronesis: Efficient performance modeling for high-dimensional configuration tuning
Y Li, BC Lee - ACM Transactions on Architecture and Code …, 2022 - dl.acm.org
We present Phronesis, a learning framework for efficiently modeling the performance of data
analytic workloads as a function of their high-dimensional software configuration …
analytic workloads as a function of their high-dimensional software configuration …
[HTML][HTML] Automated linear solver selection for simulation of multiphysics processes in porous media
Y Zabegaev, E Keilegavlen, E Iversen, I Berre - Computer Methods in …, 2024 - Elsevier
Porous media processes involve various physical phenomena such as mechanical
deformation, transport, and fluid flow. Accurate simulations must capture the strong …
deformation, transport, and fluid flow. Accurate simulations must capture the strong …