The championship simulator: Architectural simulation for education and competition

N Gober, G Chacon, L Wang, PV Gratz… - arXiv preprint arXiv …, 2022 - arxiv.org
Recent years have seen a dramatic increase in the microarchitectural complexity of
processors. This increase in complexity presents a twofold challenge for the field of …

Micro-armed bandit: lightweight & reusable reinforcement learning for microarchitecture decision-making

G Gerogiannis, J Torrellas - Proceedings of the 56th Annual IEEE/ACM …, 2023 - dl.acm.org
Online Reinforcement Learning (RL) has been adopted as an effective mechanism in
various decision-making problems in microarchitecture. Its high adaptability and the ability to …

Thermometer: profile-guided btb replacement for data center applications

S Song, TA Khan, SM Shahri, A Sriraman… - Proceedings of the 49th …, 2022 - dl.acm.org
Modern processors employ a decoupled frontend with Fetch Directed Instruction Prefetching
(FDIP) to avoid frontend stalls in data center applications. However, the large branch …

Whisper: Profile-guided branch misprediction elimination for data center applications

TA Khan, M Ugur, K Nathella, D Sunwoo… - 2022 55th IEEE/ACM …, 2022 - ieeexplore.ieee.org
Modern data center applications experience frequent branch mispredictions–degrading
performance, increasing cost, and reducing energy efficiency in data centers. Even the state …

Victima: Drastically Increasing Address Translation Reach by Leveraging Underutilized Cache Resources

K Kanellopoulos, HC Nam, N Bostanci, R Bera… - Proceedings of the 56th …, 2023 - dl.acm.org
Address translation is a performance bottleneck in data-intensive workloads due to large
datasets and irregular access patterns that lead to frequent high-latency page table walks …

Page size aware cache prefetching

G Vavouliotis, G Chacon, L Alvarez… - 2022 55th IEEE/ACM …, 2022 - ieeexplore.ieee.org
The increase in working set sizes of contemporary applications outpaces the growth in
cache sizes, resulting in frequent main memory accesses that deteriorate system …

Utopia: Fast and Efficient Address Translation via Hybrid Restrictive & Flexible Virtual-to-Physical Address Mappings

K Kanellopoulos, R Bera, K Stojiljkovic… - Proceedings of the 56th …, 2023 - dl.acm.org
Conventional virtual memory (VM) frameworks enable a virtual address to flexibly map to
any physical address. This flexibility necessitates large data structures to store virtual-to …

Rebasing microarchitectural research with industry traces

J Feliu, A Perais, DA Jiménez… - 2023 IEEE International …, 2023 - ieeexplore.ieee.org
Microarchitecture research relies on performance models with various degrees of accuracy
and speed. In the past few years, one such model, ChampSim, has started to gain significant …

The Impact of Page Size and Microarchitecture on Instruction Address Translation Overhead

Y Zhou, AL Cox, S Dwarkadas, X Dong - ACM Transactions on …, 2023 - dl.acm.org
As the volume of data processed by applications has increased, considerable attention has
been paid to data address translation overheads, leading to the widespread use of larger …

Towards High Performance and Efficient Memory Deduplication via Mixed Pages

L Yao, Y Li, F Guo, S Wu, Y Xu… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Large pages are widely supported in modern hardware and OSes to reduce the overhead of
TLB misses. However, memory deduplication can be inefficient with large pages, leading to …