A survey of machine learning for computer architecture and systems
It has been a long time that computer architecture and systems are optimized for efficient
execution of machine learning (ML) models. Now, it is time to reconsider the relationship …
execution of machine learning (ML) models. Now, it is time to reconsider the relationship …
Machine learning in compiler optimization
In the last decade, machine-learning-based compilation has moved from an obscure
research niche to a mainstream activity. In this paper, we describe the relationship between …
research niche to a mainstream activity. In this paper, we describe the relationship between …
Cross-architecture performance prediction (XAPP) using CPU code to predict GPU performance
N Ardalani, C Lestourgeon, K Sankaralingam… - Proceedings of the 48th …, 2015 - dl.acm.org
GPUs have become prevalent and more general purpose, but GPU programming remains
challenging and time consuming for the majority of programmers. In addition, it is not always …
challenging and time consuming for the majority of programmers. In addition, it is not always …
A mechanistic performance model for superscalar out-of-order processors
S Eyerman, L Eeckhout, T Karkhanis… - ACM Transactions on …, 2009 - dl.acm.org
A mechanistic model for out-of-order superscalar processors is developed and then applied
to the study of microarchitecture resource scaling. The model divides execution time into …
to the study of microarchitecture resource scaling. The model divides execution time into …
Power-performance modeling on asymmetric multi-cores
M Pricopi, TS Muthukaruppan… - … and Synthesis for …, 2013 - ieeexplore.ieee.org
Asymmetric multi-core architectures have recently emerged as a promising alternative in a
power and thermal constrained environment. They typically integrate cores with different …
power and thermal constrained environment. They typically integrate cores with different …
Respir: A response surface-based pareto iterative refinement for application-specific design space exploration
Application-specific multiprocessor systems-on-chip (MPSoCs) are usually designed by
using a platform-based approach, where a wide range of customizable parameters can be …
using a platform-based approach, where a wide range of customizable parameters can be …
Stargazer: Automated regression-based GPU design space exploration
Graphics processing units (GPUs) are of increasing interest because they offer massive
parallelism for high-throughput computing. While GPUs promise high peak performance …
parallelism for high-throughput computing. While GPUs promise high peak performance …
Micro-armed bandit: lightweight & reusable reinforcement learning for microarchitecture decision-making
G Gerogiannis, J Torrellas - Proceedings of the 56th Annual IEEE/ACM …, 2023 - dl.acm.org
Online Reinforcement Learning (RL) has been adopted as an effective mechanism in
various decision-making problems in microarchitecture. Its high adaptability and the ability to …
various decision-making problems in microarchitecture. Its high adaptability and the ability to …
A predictive model for dynamic microarchitectural adaptivity control
Adaptive micro architectures are a promising solution for designing high-performance,
power-efficient microprocessors. They offer the ability to tailor computational resources to …
power-efficient microprocessors. They offer the ability to tailor computational resources to …
Accurate phase-level cross-platform power and performance estimation
Fast and accurate performance and power prediction is a key challenge in co-development
of hardware and software. Traditional analytical or simulation-based approaches are often …
of hardware and software. Traditional analytical or simulation-based approaches are often …