Exploiting page table locality for agile tlb prefetching

N Gober, G Chacon, L Wang, PV Gratz… - arXiv preprint arXiv …, 2022 - arxiv.org

Recent years have seen a dramatic increase in the microarchitectural complexity of
processors. This increase in complexity presents a twofold challenge for the field of …

被引用次数：56 相关文章所有 2 个版本

[PDF] acm.org

Contiguitas: The pursuit of physical memory contiguity in datacenters

K Zhao, K Xue, Z Wang, D Schatzberg, L Yang… - Proceedings of the 50th …, 2023 - dl.acm.org

The unabating growth of the memory needs of emerging datacenter applications has
exacerbated the scalability bottleneck of virtual memory. However, reducing the excessive …

被引用次数：17 相关文章所有 12 个版本

[PDF] uiuc.edu

Micro-armed bandit: lightweight & reusable reinforcement learning for microarchitecture decision-making

G Gerogiannis, J Torrellas - Proceedings of the 56th Annual IEEE/ACM …, 2023 - dl.acm.org

Online Reinforcement Learning (RL) has been adopted as an effective mechanism in
various decision-making problems in microarchitecture. Its high adaptability and the ability to …

被引用次数：7 相关文章所有 5 个版本

[PDF] acm.org

IDYLL: Enhancing Page Translation in Multi-GPUs via Light Weight PTE Invalidations

B Li, Y Guo, Y Wang, A Jaleel, J Yang… - Proceedings of the 56th …, 2023 - dl.acm.org

Multi-GPU systems have emerged as a desirable platform to deliver high computing
capabilities and large memory capacity to accommodate large dataset sizes. However …

被引用次数：6 相关文章所有 8 个版本

[PDF] arxiv.org

Victima: Drastically Increasing Address Translation Reach by Leveraging Underutilized Cache Resources

K Kanellopoulos, HC Nam, N Bostanci, R Bera… - Proceedings of the 56th …, 2023 - dl.acm.org

Address translation is a performance bottleneck in data-intensive workloads due to large
datasets and irregular access patterns that lead to frequent high-latency page table walks …

被引用次数：8 相关文章所有 6 个版本

[PDF] acm.org

Morrigan: A composite instruction tlb prefetcher

G Vavouliotis, L Alvarez, B Grot, D Jiménez… - MICRO-54: 54th Annual …, 2021 - dl.acm.org

The effort to reduce address translation overheads has typically targeted data accesses
since they constitute the overwhelming portion of the second-level TLB (STLB) misses in …

被引用次数：20 相关文章所有 11 个版本

[PDF] sjtu.edu.cn

Accelerating Extra Dimensional Page Walks for Confidential Computing

D Du, B Yang, Y Xia, H Chen - Proceedings of the 56th Annual IEEE …, 2023 - dl.acm.org

To support highly scalable and fine-grained computing paradigms such as microservices
and serverless computing better, modern hardware-assisted confidential computing …

被引用次数：3 相关文章所有 4 个版本

[PDF] upc.edu

Page size aware cache prefetching

G Vavouliotis, G Chacon, L Alvarez… - 2022 55th IEEE/ACM …, 2022 - ieeexplore.ieee.org

The increase in working set sizes of contemporary applications outpaces the growth in
cache sizes, resulting in frequent main memory accesses that deteriorate system …

被引用次数：12 相关文章所有 5 个版本

[PDF] arxiv.org

A Two Level Neural Approach Combining Off-Chip Prediction with Adaptive Prefetch Filtering

AV Jamet, G Vavouliotis, DA Jiménez… - … Symposium on High …, 2024 - ieeexplore.ieee.org

To alleviate the performance and energy overheads of contemporary applications with large
data footprints, we propose the Two Level Perceptron (TLP) predictor, a neural mechanism …

被引用次数：5 相关文章所有 6 个版本

[PDF] nsf.gov

Trans-fw: Short circuiting page table walk in multi-gpu systems via remote forwarding

B Li, J Yin, A Holey, Y Zhang, J Yang… - 2023 IEEE International …, 2023 - ieeexplore.ieee.org

Multi-GPU systems have become a popular platform to meet the ever-growing application
demands. However, employing multiple GPUs does not guarantee proportional performance …

被引用次数：10 相关文章所有 6 个版本