Accelerating architectural simulation via statistical techniques: A survey

Q Guo, T Chen, Y Chen… - IEEE Transactions on …, 2015 - ieeexplore.ieee.org
In computer architecture research and development, simulation is a powerful way of
acquiring and predicting processor behaviors. While architectural simulation has been …

Principal kernel analysis: A tractable methodology to simulate scaled GPU workloads

C Avalos Baddouh, M Khairy, RN Green… - MICRO-54: 54th Annual …, 2021 - dl.acm.org
Simulating all threads in a scaled GPU workload results in prohibitive simulation cost. Cycle-
level simulation is orders of magnitude slower than native silicon, the only solution is to …

Photon: A fine-grained sampled simulation methodology for GPU workloads

C Liu, Y Sun, TE Carlson - Proceedings of the 56th Annual IEEE/ACM …, 2023 - dl.acm.org
GPUs, due to their massively-parallel computing architectures, provide high performance for
data-parallel applications. However, existing GPU simulators are too slow to enable …

GPUCloudSim: an extension of CloudSim for modeling and simulation of GPUs in cloud data centers

A Siavashi, M Momtazpour - The Journal of Supercomputing, 2019 - Springer
Recent years have witnessed an increasing growth in the usage of GPUs in cloud data
centers. It is known that conventional virtualization techniques are not directly applicable to …

A hybrid framework for fast and accurate GPU performance estimation through source-level analysis and trace-based simulation

X Wang, K Huang, A Knoll… - 2019 IEEE International …, 2019 - ieeexplore.ieee.org
This paper proposes a hybrid framework for fast and accurate performance estimation of
OpenCL kernels running on GPUs. The kernel execution flow is statically analyzed and …

Sieve: Stratified GPU-compute workload sampling

M Naderan-Tahan, H SeyyedAghaei… - … Analysis of Systems …, 2023 - ieeexplore.ieee.org
To exploit the ever increasing compute capabilities offered by GPU hardware, GPU-compute
workloads have evolved from simple computational kernels to large-scale programs with …

GCoM: a detailed GPU core model for accurate analytical modeling of modern GPUs

J Lee, Y Ha, S Lee, J Woo, J Lee, H Jang… - Proceedings of the 49th …, 2022 - dl.acm.org
Analytical models can greatly help computer architects perform orders of magnitude faster
early-stage design space exploration than using cycle-level simulators. To facilitate rapid …

TBPoint: Reducing simulation time for large-scale GPGPU kernels

JC Huang, L Nai, H Kim… - 2014 IEEE 28th …, 2014 - ieeexplore.ieee.org
Architecture simulation for GPGPU kernels can take a significant amount of time, especially
for large-scale GPGPU kernels. This paper presents TBPoint, an infrastructure based on …

GPU performance estimation using software rasterization and machine learning

K O'neal, P Brisk, A Abousamra, Z Waters… - ACM Transactions on …, 2017 - dl.acm.org
This paper introduces a predictive modeling framework to estimate the performance of GPUs
during pre-silicon design. Early-stage performance prediction is useful when simulation …

Efficient performance estimation and work-group size pruning for OpenCl kernels on GPUs

X Wang, X Qian, A Knoll… - IEEE Transactions on …, 2019 - ieeexplore.ieee.org
Graphic Processing Units (GPUs) play a vital role in state-of-the-art high-performance
scientific computing realm and research work towards its performance analysis is crucial but …