The championship simulator: Architectural simulation for education and competition

N Gober, G Chacon, L Wang, PV Gratz… - arXiv preprint arXiv …, 2022 - arxiv.org
Recent years have seen a dramatic increase in the microarchitectural complexity of
processors. This increase in complexity presents a twofold challenge for the field of …

Contiguitas: The pursuit of physical memory contiguity in datacenters

K Zhao, K Xue, Z Wang, D Schatzberg, L Yang… - Proceedings of the 50th …, 2023 - dl.acm.org
The unabating growth of the memory needs of emerging datacenter applications has
exacerbated the scalability bottleneck of virtual memory. However, reducing the excessive …

Micro-armed bandit: lightweight & reusable reinforcement learning for microarchitecture decision-making

G Gerogiannis, J Torrellas - Proceedings of the 56th Annual IEEE/ACM …, 2023 - dl.acm.org
Online Reinforcement Learning (RL) has been adopted as an effective mechanism in
various decision-making problems in microarchitecture. Its high adaptability and the ability to …

IDYLL: Enhancing Page Translation in Multi-GPUs via Light Weight PTE Invalidations

B Li, Y Guo, Y Wang, A Jaleel, J Yang… - Proceedings of the 56th …, 2023 - dl.acm.org
Multi-GPU systems have emerged as a desirable platform to deliver high computing
capabilities and large memory capacity to accommodate large dataset sizes. However …

Victima: Drastically Increasing Address Translation Reach by Leveraging Underutilized Cache Resources

K Kanellopoulos, HC Nam, N Bostanci, R Bera… - Proceedings of the 56th …, 2023 - dl.acm.org
Address translation is a performance bottleneck in data-intensive workloads due to large
datasets and irregular access patterns that lead to frequent high-latency page table walks …

Morrigan: A composite instruction tlb prefetcher

G Vavouliotis, L Alvarez, B Grot, D Jiménez… - MICRO-54: 54th Annual …, 2021 - dl.acm.org
The effort to reduce address translation overheads has typically targeted data accesses
since they constitute the overwhelming portion of the second-level TLB (STLB) misses in …

Accelerating Extra Dimensional Page Walks for Confidential Computing

D Du, B Yang, Y Xia, H Chen - Proceedings of the 56th Annual IEEE …, 2023 - dl.acm.org
To support highly scalable and fine-grained computing paradigms such as microservices
and serverless computing better, modern hardware-assisted confidential computing …

Page size aware cache prefetching

G Vavouliotis, G Chacon, L Alvarez… - 2022 55th IEEE/ACM …, 2022 - ieeexplore.ieee.org
The increase in working set sizes of contemporary applications outpaces the growth in
cache sizes, resulting in frequent main memory accesses that deteriorate system …

A Two Level Neural Approach Combining Off-Chip Prediction with Adaptive Prefetch Filtering

AV Jamet, G Vavouliotis, DA Jiménez… - … Symposium on High …, 2024 - ieeexplore.ieee.org
To alleviate the performance and energy overheads of contemporary applications with large
data footprints, we propose the Two Level Perceptron (TLP) predictor, a neural mechanism …

Trans-fw: Short circuiting page table walk in multi-gpu systems via remote forwarding

B Li, J Yin, A Holey, Y Zhang, J Yang… - 2023 IEEE International …, 2023 - ieeexplore.ieee.org
Multi-GPU systems have become a popular platform to meet the ever-growing application
demands. However, employing multiple GPUs does not guarantee proportional performance …