A survey of computer architecture simulation techniques and tools

A Akram, L Sawalha - Ieee Access, 2019 - ieeexplore.ieee.org
Computer architecture simulators play an important role in advancing computer architecture
research. With wider research directions and the increased number of simulators that have …

[HTML][HTML] Estimation of energy consumption in machine learning

E García-Martín, CF Rodrigues, G Riley… - Journal of Parallel and …, 2019 - Elsevier
Energy consumption has been widely studied in the computer architecture field for decades.
While the adoption of energy as a metric in machine learning is emerging, the majority of …

Mapping techniques in multicore processors: current and future trends

M Gupta, L Bhargava, S Indu - The Journal of Supercomputing, 2021 - Springer
Multicore systems are in demand due to their high performance thus making application
mapping an important research area in this field. Breaking an application into multiple …

AccelWattch: A power modeling framework for modern GPUs

V Kandiah, S Peverelle, M Khairy, J Pan… - MICRO-54: 54th Annual …, 2021 - dl.acm.org
Graphics Processing Units (GPUs) are rapidly dominating the accelerator space, as
illustrated by their wide-spread adoption in the data analytics and machine learning markets …

Prodigy: Improving the memory latency of data-indirect irregular workloads using hardware-software co-design

N Talati, K May, A Behroozi, Y Yang… - … Symposium on High …, 2021 - ieeexplore.ieee.org
Irregular workloads are typically bottlenecked by the memory system. These workloads often
use sparse data representations, eg, compressed sparse row/column (CSR/CSC), to …

Analysis and optimization of the memory hierarchy for graph processing workloads

A Basak, S Li, X Hu, SM Oh, X Xie… - … Symposium on High …, 2019 - ieeexplore.ieee.org
Graph processing is an important analysis technique for a wide range of big data
applications. The ability to explicitly represent relationships between entities gives graph …

Decoupled vector runahead

A Naithani, J Roelandts, S Ainsworth… - Proceedings of the 56th …, 2023 - dl.acm.org
We present Decoupled Vector Runahead (DVR), an in-core prefetching technique,
executing separately to the main application thread, that exploits massive amounts of …

Daemon: Architectural support for efficient data movement in fully disaggregated systems

C Giannoula, K Huang, J Tang, N Koziris… - Proceedings of the …, 2023 - dl.acm.org
Resource disaggregation offers a cost effective solution to resource scaling, utilization, and
failure-handling in data centers by physically separating hardware devices in a server …

Domain-specialized cache management for graph analytics

P Faldu, J Diamond, B Grot - 2020 IEEE International …, 2020 - ieeexplore.ieee.org
Graph analytics power a range of applications in areas as diverse as finance, networking
and business logistics. A common property of graphs used in the domain of graph analytics …

Machine learning-based approaches for energy-efficiency prediction and scheduling in composite cores architectures

H Sayadi, N Patel, A Sasan… - 2017 IEEE international …, 2017 - ieeexplore.ieee.org
Heterogeneous architectures offer divers computing capabilities. Composite Cores
Architecture (CCA) is a class of dynamic heterogeneous architectures that empowers the …