Cache-conscious wavefront scheduling TG Rogers, M O'Connor, TM Aamodt 2012 45th Annual IEEE/ACM International Symposium on Microarchitecture, 72-83, 2012 | 550 | 2012 |
Accel-Sim: An extensible simulation framework for validated GPU modeling M Khairy, Z Shen, TM Aamodt, TG Rogers 2020 ACM/IEEE 47th Annual International Symposium on Computer Architecture …, 2020 | 264* | 2020 |
Divergence-aware warp scheduling TG Rogers, M O'Connor, TM Aamodt Proceedings of the 46th Annual IEEE/ACM International Symposium on …, 2013 | 197 | 2013 |
Characterizing and evaluating a key-value store application on heterogeneous CPU-GPU systems TH Hetherington, TG Rogers, L Hsu, M O'Connor, TM Aamodt 2012 IEEE International Symposium on Performance Analysis of Systems …, 2012 | 148 | 2012 |
General-purpose graphics processor architectures TM Aamodt, WWL Fung, TG Rogers, M Martonosi Morgan & Claypool Publishers, 2018 | 95 | 2018 |
Lost in abstraction: Pitfalls of analyzing GPUs at the intermediate language level A Gutierrez, BM Beckmann, A Dutu, J Gross, M LeBeane, J Kalamatianos, ... 2018 IEEE International Symposium on High Performance Computer Architecture …, 2018 | 89 | 2018 |
Analyzing machine learning workloads using a detailed GPU simulator J Lew, DA Shah, S Pati, S Cattell, M Zhang, A Sandhupatla, C Ng, N Goli, ... 2019 IEEE international symposium on performance analysis of systems and …, 2019 | 84 | 2019 |
AccelWattch: A power modeling framework for modern GPUs V Kandiah, S Peverelle, M Khairy, J Pan, A Manjunath, TG Rogers, ... MICRO-54: 54th Annual IEEE/ACM International Symposium on Microarchitecture …, 2021 | 78 | 2021 |
GPGPU-Sim 3. x manual TM Aamodt, WWL Fung, I Singh, A El-Shafiey, J Kwa, T Hetherington, ... 2012-08-08)[2013-08-08]. http:∥ gpgpu-sim. org/manual/index. php/GPGPU …, 2012 | 66 | 2012 |
A variable warp size architecture TG Rogers, DR Johnson, M O'Connor, SW Keckler ACM SIGARCH Computer Architecture News 43 (3S), 489-501, 2015 | 62 | 2015 |
Pagoda: Fine-grained gpu resource virtualization for narrow tasks TT Yeh, A Sabne, P Sakdhnagool, R Eigenmann, TG Rogers ACM SIGPLAN Notices 52 (8), 221-234, 2017 | 58 | 2017 |
Locality-centric data and threadblock management for massive GPUs M Khairy, V Nikiforov, D Nellans, TG Rogers 2020 53rd Annual IEEE/ACM International Symposium on Microarchitecture …, 2020 | 28 | 2020 |
Principal kernel analysis: A tractable methodology to simulate scaled GPU workloads C Avalos Baddouh, M Khairy, RN Green, M Payer, TG Rogers MICRO-54: 54th Annual IEEE/ACM International Symposium on Microarchitecture …, 2021 | 23 | 2021 |
A quantitative evaluation of contemporary gpu simulation methodology A Jain, M Khairy, TG Rogers Proceedings of the ACM on Measurement and Analysis of Computing Systems 2 (2 …, 2018 | 18 | 2018 |
Creating SIMD efficient code by transferring register state through common memory TG Rogers, BM Beckmann, JM O'connor US Patent 9,354,892, 2016 | 15 | 2016 |
Deadline-aware offloading for high-throughput accelerators TT Yeh, MD Sinclair, BM Beckmann, TG Rogers 2021 IEEE International Symposium on High-Performance Computer Architecture …, 2021 | 14 | 2021 |
Deterministic atomic buffering YH Chou, C Ng, S Cattell, J Intan, MD Sinclair, J Devietti, TG Rogers, ... 2020 53rd Annual IEEE/ACM International Symposium on Microarchitecture …, 2020 | 13 | 2020 |
Dimensionality-aware redundant SIMT instruction elimination TT Yeh, RN Green, TG Rogers Proceedings of the Twenty-Fifth International Conference on Architectural …, 2020 | 12 | 2020 |
A detailed model for contemporary GPU memory systems M Khairy, A Jain, TM Aamodt, TG Rogers 2019 IEEE International Symposium on Performance Analysis of Systems and …, 2019 | 9 | 2019 |
Cache-conscious thread scheduling for massively multithreaded processors TG Rogers, M O'Connor, TM Aamodt IEEE Micro 33 (3), 78-85, 2013 | 9 | 2013 |