A survey of machine learning for computer architecture and systems

N Wu, Y Xie - ACM Computing Surveys (CSUR), 2022 - dl.acm.org
It has been a long time that computer architecture and systems are optimized for efficient
execution of machine learning (ML) models. Now, it is time to reconsider the relationship …

Machine learning in compiler optimization

Z Wang, M O'Boyle - Proceedings of the IEEE, 2018 - ieeexplore.ieee.org
In the last decade, machine-learning-based compilation has moved from an obscure
research niche to a mainstream activity. In this paper, we describe the relationship between …

Cross-architecture performance prediction (XAPP) using CPU code to predict GPU performance

N Ardalani, C Lestourgeon, K Sankaralingam… - Proceedings of the 48th …, 2015 - dl.acm.org
GPUs have become prevalent and more general purpose, but GPU programming remains
challenging and time consuming for the majority of programmers. In addition, it is not always …

A mechanistic performance model for superscalar out-of-order processors

S Eyerman, L Eeckhout, T Karkhanis… - ACM Transactions on …, 2009 - dl.acm.org
A mechanistic model for out-of-order superscalar processors is developed and then applied
to the study of microarchitecture resource scaling. The model divides execution time into …

Power-performance modeling on asymmetric multi-cores

M Pricopi, TS Muthukaruppan… - … and Synthesis for …, 2013 - ieeexplore.ieee.org
Asymmetric multi-core architectures have recently emerged as a promising alternative in a
power and thermal constrained environment. They typically integrate cores with different …

Respir: A response surface-based pareto iterative refinement for application-specific design space exploration

G Palermo, C Silvano, V Zaccaria - IEEE Transactions on …, 2009 - ieeexplore.ieee.org
Application-specific multiprocessor systems-on-chip (MPSoCs) are usually designed by
using a platform-based approach, where a wide range of customizable parameters can be …

Stargazer: Automated regression-based GPU design space exploration

W Jia, KA Shaw, M Martonosi - 2012 IEEE International …, 2012 - ieeexplore.ieee.org
Graphics processing units (GPUs) are of increasing interest because they offer massive
parallelism for high-throughput computing. While GPUs promise high peak performance …

Micro-armed bandit: lightweight & reusable reinforcement learning for microarchitecture decision-making

G Gerogiannis, J Torrellas - Proceedings of the 56th Annual IEEE/ACM …, 2023 - dl.acm.org
Online Reinforcement Learning (RL) has been adopted as an effective mechanism in
various decision-making problems in microarchitecture. Its high adaptability and the ability to …

A predictive model for dynamic microarchitectural adaptivity control

C Dubach, TM Jones, EV Bonilla… - 2010 43rd Annual …, 2010 - ieeexplore.ieee.org
Adaptive micro architectures are a promising solution for designing high-performance,
power-efficient microprocessors. They offer the ability to tailor computational resources to …

Accurate phase-level cross-platform power and performance estimation

X Zheng, LK John, A Gerstlauer - Proceedings of the 53rd Annual Design …, 2016 - dl.acm.org
Fast and accurate performance and power prediction is a key challenge in co-development
of hardware and software. Traditional analytical or simulation-based approaches are often …