Harnessing integrated cpu-gpu system memory for hpc: a first look into grace hopper

G Schieffer, J Wahlgren, J Ren, J Faj… - Proceedings of the 53rd …, 2024 - dl.acm.org
Memory management across discrete CPU and GPU physical memory is traditionally
achieved through explicit GPU allocations and data copy or unified virtual memory. The …

GrOUT: Transparent Scale-Out to Overcome UVM's Oversubscription Slowdowns

IDD Lavore, D Maffi, M Arnaboldi… - 2024 IEEE …, 2024 - ieeexplore.ieee.org
Hardware accelerators have always been difficult to approach. In recent years, we have
experienced great efforts to simplify their programming paradigms, especially on GPUs. This …

[图书][B] Visual Analytics Techniques for Investigating Large-Scale HPC Profiles and Trace Data

SP Kesavan - 2023 - search.proquest.com
Performance visualization is an emerging field that adapts to the growing ecosystem of High-
Performance Computing (HPC). With the continued growth in scale and complexity of HPC …

Beyond Vablock: Improving Transformer Workloads Through Aggressive Prefetching

J Rhee, I Choi, G Koo, Y Oh, MK Yoon - Available at SSRN 5007418 - papers.ssrn.com
The memory capacity constraint of GPUs is a major challenge in running large deep
learning workloads with their ever increasing memory requirements. To run a large …

[引用][C] 인공지능성능극대화를위한그래픽처리장치의발전과연구동향

이제인, 정은비, 윤명국 - 정보과학회지, 2024 - dbpia.co.kr
오늘날은 소프트웨어 개발뿐 아니라 사회 전반적으로도 가히 인공지능 (Aritifical Intelligence,
AI) 의 시대라 할 수 있다. 머신러닝 (Machine Learning) 과 딥러닝 (Deep Learning) 등의 관련 …