The championship simulator: Architectural simulation for education and competition
Recent years have seen a dramatic increase in the microarchitectural complexity of
processors. This increase in complexity presents a twofold challenge for the field of …
processors. This increase in complexity presents a twofold challenge for the field of …
Micro-armed bandit: lightweight & reusable reinforcement learning for microarchitecture decision-making
G Gerogiannis, J Torrellas - Proceedings of the 56th Annual IEEE/ACM …, 2023 - dl.acm.org
Online Reinforcement Learning (RL) has been adopted as an effective mechanism in
various decision-making problems in microarchitecture. Its high adaptability and the ability to …
various decision-making problems in microarchitecture. Its high adaptability and the ability to …
Thermometer: profile-guided btb replacement for data center applications
Modern processors employ a decoupled frontend with Fetch Directed Instruction Prefetching
(FDIP) to avoid frontend stalls in data center applications. However, the large branch …
(FDIP) to avoid frontend stalls in data center applications. However, the large branch …
Whisper: Profile-guided branch misprediction elimination for data center applications
Modern data center applications experience frequent branch mispredictions–degrading
performance, increasing cost, and reducing energy efficiency in data centers. Even the state …
performance, increasing cost, and reducing energy efficiency in data centers. Even the state …
Victima: Drastically Increasing Address Translation Reach by Leveraging Underutilized Cache Resources
Address translation is a performance bottleneck in data-intensive workloads due to large
datasets and irregular access patterns that lead to frequent high-latency page table walks …
datasets and irregular access patterns that lead to frequent high-latency page table walks …
Page size aware cache prefetching
The increase in working set sizes of contemporary applications outpaces the growth in
cache sizes, resulting in frequent main memory accesses that deteriorate system …
cache sizes, resulting in frequent main memory accesses that deteriorate system …
Utopia: Fast and Efficient Address Translation via Hybrid Restrictive & Flexible Virtual-to-Physical Address Mappings
K Kanellopoulos, R Bera, K Stojiljkovic… - Proceedings of the 56th …, 2023 - dl.acm.org
Conventional virtual memory (VM) frameworks enable a virtual address to flexibly map to
any physical address. This flexibility necessitates large data structures to store virtual-to …
any physical address. This flexibility necessitates large data structures to store virtual-to …
Rebasing microarchitectural research with industry traces
Microarchitecture research relies on performance models with various degrees of accuracy
and speed. In the past few years, one such model, ChampSim, has started to gain significant …
and speed. In the past few years, one such model, ChampSim, has started to gain significant …
The Impact of Page Size and Microarchitecture on Instruction Address Translation Overhead
As the volume of data processed by applications has increased, considerable attention has
been paid to data address translation overheads, leading to the widespread use of larger …
been paid to data address translation overheads, leading to the widespread use of larger …
Towards High Performance and Efficient Memory Deduplication via Mixed Pages
L Yao, Y Li, F Guo, S Wu, Y Xu… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Large pages are widely supported in modern hardware and OSes to reduce the overhead of
TLB misses. However, memory deduplication can be inefficient with large pages, leading to …
TLB misses. However, memory deduplication can be inefficient with large pages, leading to …