The championship simulator: Architectural simulation for education and competition
Recent years have seen a dramatic increase in the microarchitectural complexity of
processors. This increase in complexity presents a twofold challenge for the field of …
processors. This increase in complexity presents a twofold challenge for the field of …
Contiguitas: The pursuit of physical memory contiguity in datacenters
The unabating growth of the memory needs of emerging datacenter applications has
exacerbated the scalability bottleneck of virtual memory. However, reducing the excessive …
exacerbated the scalability bottleneck of virtual memory. However, reducing the excessive …
Micro-armed bandit: lightweight & reusable reinforcement learning for microarchitecture decision-making
G Gerogiannis, J Torrellas - Proceedings of the 56th Annual IEEE/ACM …, 2023 - dl.acm.org
Online Reinforcement Learning (RL) has been adopted as an effective mechanism in
various decision-making problems in microarchitecture. Its high adaptability and the ability to …
various decision-making problems in microarchitecture. Its high adaptability and the ability to …
IDYLL: Enhancing Page Translation in Multi-GPUs via Light Weight PTE Invalidations
Multi-GPU systems have emerged as a desirable platform to deliver high computing
capabilities and large memory capacity to accommodate large dataset sizes. However …
capabilities and large memory capacity to accommodate large dataset sizes. However …
Victima: Drastically Increasing Address Translation Reach by Leveraging Underutilized Cache Resources
Address translation is a performance bottleneck in data-intensive workloads due to large
datasets and irregular access patterns that lead to frequent high-latency page table walks …
datasets and irregular access patterns that lead to frequent high-latency page table walks …
Morrigan: A composite instruction tlb prefetcher
The effort to reduce address translation overheads has typically targeted data accesses
since they constitute the overwhelming portion of the second-level TLB (STLB) misses in …
since they constitute the overwhelming portion of the second-level TLB (STLB) misses in …
Accelerating Extra Dimensional Page Walks for Confidential Computing
To support highly scalable and fine-grained computing paradigms such as microservices
and serverless computing better, modern hardware-assisted confidential computing …
and serverless computing better, modern hardware-assisted confidential computing …
Page size aware cache prefetching
The increase in working set sizes of contemporary applications outpaces the growth in
cache sizes, resulting in frequent main memory accesses that deteriorate system …
cache sizes, resulting in frequent main memory accesses that deteriorate system …
A Two Level Neural Approach Combining Off-Chip Prediction with Adaptive Prefetch Filtering
To alleviate the performance and energy overheads of contemporary applications with large
data footprints, we propose the Two Level Perceptron (TLP) predictor, a neural mechanism …
data footprints, we propose the Two Level Perceptron (TLP) predictor, a neural mechanism …
Trans-fw: Short circuiting page table walk in multi-gpu systems via remote forwarding
Multi-GPU systems have become a popular platform to meet the ever-growing application
demands. However, employing multiple GPUs does not guarantee proportional performance …
demands. However, employing multiple GPUs does not guarantee proportional performance …