OWL: cooperative thread array aware scheduling techniques for improving GPGPU performance A Jog, O Kayiran, N Chidambaram Nachiappan, AK Mishra, MT Kandemir, ... ACM SIGPLAN Notices 48 (4), 395-406, 2013 | 369 | 2013 |
Neither More Nor Less: Optimizing Thread-Level Parallelism for GPGPUs O Kayiran, A Jog, MT Kandemir, CR Das In Proceedings of PACT-2013, 22nd International Conference on Parallel …, 2013 | 332 | 2013 |
Orchestrated scheduling and prefetching for GPGPUs A Jog, O Kayiran, AK Mishra, MT Kandemir, O Mutlu, R Iyer, CR Das Proceedings of the 40th Annual International Symposium on Computer …, 2013 | 247 | 2013 |
Scheduling techniques for GPU architectures with processing-in-memory capabilities A Pattnaik, X Tang, A Jog, O Kayiran, AK Mishra, MT Kandemir, O Mutlu, ... Proceedings of the 2016 International Conference on Parallel Architectures …, 2016 | 227 | 2016 |
Managing GPU Concurrency in Heterogeneous Architectures O Kayıran, NC Nachiappan, A Jog, R Ausavarungnirun, MT Kandemir, ... Proceeding of The 47th Annual IEEE/ACM International Symposium on …, 2014 | 173 | 2014 |
Modular routing design for chiplet-based systems J Yin, Z Lin, O Kayiran, M Poremba, MSB Altaf, NE Jerger, GH Loh 2018 ACM/IEEE 45th Annual International Symposium on Computer Architecture …, 2018 | 116 | 2018 |
Anatomy of gpu memory system for multi-application execution A Jog, O Kayiran, T Kesten, A Pattnaik, E Bolotin, N Chatterjee, ... Proceedings of the 2015 International Symposium on Memory Systems, 223-234, 2015 | 110 | 2015 |
Design and Analysis of an APU for Exascale Computing T Vijayaraghavan, Y Eckert, GH Loh, MJ Schulte, M Ignatowski, ... 2017 IEEE International Symposium on High Performance Computer Architecture …, 2017 | 95 | 2017 |
Exploiting inter-warp heterogeneity to improve GPGPU performance R Ausavarungnirun, S Ghose, O Kayiran, GH Loh, CR Das, MT Kandemir, ... 2015 International Conference on Parallel Architecture and Compilation (PACT …, 2015 | 94 | 2015 |
Lost in abstraction: Pitfalls of analyzing GPUs at the intermediate language level A Gutierrez, BM Beckmann, A Dutu, J Gross, M LeBeane, J Kalamatianos, ... 2018 IEEE International Symposium on High Performance Computer Architecture …, 2018 | 83 | 2018 |
OSCAR: Orchestrating STT-RAM cache traffic for heterogeneous CPU-GPU architectures J Zhan, O Kayıran, GH Loh, CR Das, Y Xie 2016 49th annual IEEE/ACM international symposium on microarchitecture …, 2016 | 63 | 2016 |
Exploiting core criticality for enhanced GPU performance A Jog, O Kayiran, A Pattnaik, MT Kandemir, O Mutlu, R Iyer, CR Das Proceedings of the 2016 ACM SIGMETRICS International Conference on …, 2016 | 62 | 2016 |
Controlled kernel launch for dynamic parallelism in GPUs X Tang, A Pattnaik, H Jiang, O Kayiran, A Jog, S Pai, M Ibrahim, ... 2017 IEEE International Symposium on High Performance Computer Architecture …, 2017 | 61 | 2017 |
Opportunistic computing in gpu architectures A Pattnaik, X Tang, O Kayiran, A Jog, A Mishra, MT Kandemir, ... Proceedings of the 46th international symposium on computer architecture …, 2019 | 52 | 2019 |
Efficient and fair multi-programming in GPUs via effective bandwidth management H Wang, F Luo, M Ibrahim, O Kayiran, A Jog 2018 IEEE International Symposium on High Performance Computer Architecture …, 2018 | 50 | 2018 |
μC-States: Fine-grained GPU datapath power management O Kayiran, A Jog, A Pattnaik, R Ausavarungnirun, X Tang, MT Kandemir, ... Proceedings of the 2016 International Conference on Parallel Architectures …, 2016 | 49 | 2016 |
There and back again: Optimizing the interconnect in networks of memory cubes M Poremba, I Akgun, J Yin, O Kayiran, Y Xie, GH Loh ACM SIGARCH Computer Architecture News 45 (2), 678-690, 2017 | 29 | 2017 |
Coda: Enabling co-location of computation and data for multiple gpu systems H Kim, R Hadidi, L Nai, H Kim, N Jayasena, Y Eckert, O Kayiran, G Loh ACM Transactions on Architecture and Code Optimization (TACO) 15 (3), 1-23, 2018 | 27 | 2018 |
Architectural support for efficient large-scale automata processing H Liu, M Ibrahim, O Kayiran, S Pai, A Jog 2018 51st Annual IEEE/ACM International Symposium on Microarchitecture …, 2018 | 24 | 2018 |
Analyzing and leveraging remote-core bandwidth for enhanced performance in GPUs MA Ibrahim, H Liu, O Kayiran, A Jog 2019 28th International Conference on Parallel Architectures and Compilation …, 2019 | 22 | 2019 |