A survey of machine learning for computer architecture and systems

N Wu, Y Xie - ACM Computing Surveys (CSUR), 2022 - dl.acm.org
It has been a long time that computer architecture and systems are optimized for efficient
execution of machine learning (ML) models. Now, it is time to reconsider the relationship …

Research challenges in parallel and distributed simulation

RM Fujimoto - ACM Transactions on Modeling and Computer …, 2016 - dl.acm.org
The parallel and distributed simulation field has evolved and grown from its origins in the
1970s and 1980s and remains an active field of research to this day. A brief overview of …

Machine learning in compiler optimization

Z Wang, M O'Boyle - Proceedings of the IEEE, 2018 - ieeexplore.ieee.org
In the last decade, machine-learning-based compilation has moved from an obscure
research niche to a mainstream activity. In this paper, we describe the relationship between …

Hybrid MPI/OpenMP parallel programming on clusters of multi-core SMP nodes

R Rabenseifner, G Hager, G Jost - 2009 17th Euromicro …, 2009 - ieeexplore.ieee.org
Today most systems in high-performance computing (HPC) feature a hierarchical hardware
design: Shared memory nodes with several multi-core CPUs are connected via a network …

Selecting stars: The k most representative skyline operator

X Lin, Y Yuan, Q Zhang, Y Zhang - 2007 IEEE 23rd …, 2006 - ieeexplore.ieee.org
Skyline computation has many applications including multi-criteria decision making. In this
paper, we study the problem of selecting k skyline points so that the number of points, which …

Exploring hardware overprovisioning in power-constrained, high performance computing

T Patki, DK Lowenthal, B Rountree, M Schulz… - Proceedings of the 27th …, 2013 - dl.acm.org
Most recent research in power-aware supercomputing has focused on making individual
nodes more efficient and measuring the results in terms of flops per watt. While this work is …

A simplified and accurate model of power-performance efficiency on emergent GPU architectures

S Song, C Su, B Rountree… - 2013 IEEE 27th …, 2013 - ieeexplore.ieee.org
Emergent heterogeneous systems must be optimized for both power and performance at
exascale. Massive parallelism combined with complex memory hierarchies form a barrier to …

Hybrid MPI/OpenMP power-aware computing

D Li, BR de Supinski, M Schulz… - … on Parallel & …, 2010 - ieeexplore.ieee.org
Power-aware execution of parallel programs is now a primary concern in large-scale HPC
environments. Prior research in this area has explored models and algorithms based on …

Predicting performance impact of DVFS for realistic memory systems

R Miftakhutdinov, E Ebrahimi… - 2012 45th Annual IEEE …, 2012 - ieeexplore.ieee.org
Dynamic voltage and frequency scaling (DVFS) can make modern processors more power
and energy efficient if we can accurately predict the effect of frequency scaling on processor …

A reconfiguration algorithm for power-aware parallel applications

D De Sensi, M Torquati, M Danelutto - ACM Transactions on Architecture …, 2016 - dl.acm.org
In current computing systems, many applications require guarantees on their maximum
power consumption to not exceed the available power budget. On the other hand, for some …