A survey of machine learning for computer architecture and systems

N Wu, Y Xie - ACM Computing Surveys (CSUR), 2022 - dl.acm.org
It has been a long time that computer architecture and systems are optimized for efficient
execution of machine learning (ML) models. Now, it is time to reconsider the relationship …

Predictive performance modeling for distributed batch processing using black box monitoring and machine learning

C Witt, M Bux, W Gusew, U Leser - Information Systems, 2019 - Elsevier
In many domains, the previous decade was characterized by increasing data volumes and
growing complexity of data analyses, creating new demands for batch processing on …

Machine learning in compiler optimization

Z Wang, M O'Boyle - Proceedings of the IEEE, 2018 - ieeexplore.ieee.org
In the last decade, machine-learning-based compilation has moved from an obscure
research niche to a mainstream activity. In this paper, we describe the relationship between …

Data-efficient performance learning for configurable systems

J Guo, D Yang, N Siegmund, S Apel, A Sarkar… - Empirical Software …, 2018 - Springer
Many software systems today are configurable, offering customization of functionality by
feature selection. Understanding how performance varies in terms of feature selection is key …

Using automated performance modeling to find scalability bugs in complex codes

A Calotoiu, T Hoefler, M Poke, F Wolf - Proceedings of the International …, 2013 - dl.acm.org
Many parallel applications suffer from latent performance limitations that may prevent them
from scaling to larger machine sizes. Often, such scalability bugs manifest themselves only …

A regression-based approach to scalability prediction

BJ Barnes, B Rountree, DK Lowenthal… - Proceedings of the …, 2008 - dl.acm.org
Many applied scientific domains are increasingly relying on large-scale parallel
computation. Consequently, many large clusters now have thousands of processors …

Prediction models for multi-dimensional power-performance optimization on many cores

M Curtis-Maury, A Shah, F Blagojevic… - Proceedings of the 17th …, 2008 - dl.acm.org
Power has become a primary concern for HPC systems. Dynamic voltage and frequency
scaling (DVFS) and dynamic concurrency throttling (DCT) are two software tools (or knobs) …

Benchmarking machine learning methods for performance modeling of scientific applications

P Malakar, P Balaprakash… - 2018 IEEE/ACM …, 2018 - ieeexplore.ieee.org
Performance modeling is an important and active area of research in high-performance
computing (HPC). It helps in better job scheduling and also improves overall performance of …

Phantom: predicting performance of parallel applications on large-scale parallel machines using a single node

J Zhai, W Chen, W Zheng - ACM sigplan notices, 2010 - dl.acm.org
For designers of large-scale parallel computers, it is greatly desired that performance of
parallel applications can be predicted at the design phase. However, this is difficult because …

Respir: A response surface-based pareto iterative refinement for application-specific design space exploration

G Palermo, C Silvano, V Zaccaria - IEEE Transactions on …, 2009 - ieeexplore.ieee.org
Application-specific multiprocessor systems-on-chip (MPSoCs) are usually designed by
using a platform-based approach, where a wide range of customizable parameters can be …