Lost in abstraction: Pitfalls of analyzing GPUs at the intermediate language level A Gutierrez, BM Beckmann, A Dutu, J Gross, M LeBeane, J Kalamatianos, ... 2018 IEEE International Symposium on High Performance Computer Architecture …, 2018 | 83 | 2018 |
Data partitioning strategies for graph workloads on heterogeneous clusters M LeBeane, S Song, R Panda, JH Ryoo, LK John Proceedings of the International Conference for High Performance Computing …, 2015 | 56 | 2015 |
Neighborhood-aware address translation for irregular GPU applications S Shin, M LeBeane, Y Solihin, A Basu 2018 51st Annual IEEE/ACM International Symposium on Microarchitecture …, 2018 | 32 | 2018 |
Watt watcher: fine-grained power estimation for emerging workloads M LeBeane, JH Ryoo, R Panda, LK John 2015 27th International Symposium on Computer Architecture and High …, 2015 | 26 | 2015 |
GPU triggered networking for intra-kernel communications M LeBeane, K Hamidouche, B Benton, M Breternitz, SK Reinhardt, ... Proceedings of the International Conference for High Performance Computing …, 2017 | 25 | 2017 |
Proxy-guided load balancing of graph processing workloads on heterogeneous clusters S Song, M Li, X Zheng, M LeBeane, JH Ryoo, R Panda, A Gerstlauer, ... 2016 45th International Conference on Parallel Processing (ICPP), 77-86, 2016 | 25 | 2016 |
Performance characterization of modern databases on out-of-order cpus R Panda, C Erb, M Lebeane, JH Ryoo, LK John 2015 27th International Symposium on Computer Architecture and High …, 2015 | 24 | 2015 |
Gpgpu benchmark suites: How well do they sample the performance spectrum? JH Ryoo, SJ Quirem, M Lebeane, R Panda, S Song, LK John 2015 44th International Conference on Parallel Processing, 320-329, 2015 | 18 | 2015 |
GPU initiated OpenSHMEM: correct and efficient intra-kernel networking for dGPUs K Hamidouche, M LeBeane Proceedings of the 25th ACM SIGPLAN Symposium on Principles and Practice of …, 2020 | 16 | 2020 |
Extended task queuing: Active messages for heterogeneous systems M LeBeane, B Potter, A Pan, A Dutu, V Agarwala, W Lee, D Majeti, ... SC'16: Proceedings of the International Conference for High Performance …, 2016 | 16 | 2016 |
GPU remote communication with triggered operations MW LeBeane, SK Reinhardt US Patent 10,936,533, 2021 | 14 | 2021 |
Optimizing GPU cache policies for MI workloads J Alsop, MD Sinclair, S Bharadwaj, A Dutu, A Gutierrez, O Kayiran, ... 2019 IEEE International Symposium on Workload Characterization (IISWC), 243-248, 2019 | 11 | 2019 |
ComP-net: Command processor networking for efficient intra-kernel communications on GPUs M LeBeane, K Hamidouche, B Benton, M Breternitz, SK Reinhardt, ... Proceedings of the 27th International Conference on Parallel Architectures …, 2018 | 8 | 2018 |
Genesys: Automatically generating representative training sets for predictive benchmarking R Panda, X Zheng, S Song, JH Ryoo, M LeBeane, A Gerstlauer, LK John 2016 International Conference on Embedded Computer Systems: Architectures …, 2016 | 8 | 2016 |
Increasing gpu translation reach by leveraging under-utilized on-chip resources JB Kotra, M LeBeane, MT Kandemir, GH Loh MICRO-54: 54th Annual IEEE/ACM International Symposium on Microarchitecture …, 2021 | 7 | 2021 |
Efficient memory-semantic networking using scoped memory models MW LeBeane, K Hamidouche, HS Thangirala, BK Potter US Patent 11,714,559, 2023 | 4 | 2023 |
Network-related performance for gpus MW LeBeane, K Hamidouche, BM Beckmann US Patent App. 16/049,216, 2020 | 2 | 2020 |
GPU networking using an integrated command processor MW LeBeane, K Hamidouche, WB Benton US Patent 11,544,121, 2023 | 1 | 2023 |
Network packet templating for GPU-initiated communication K Hamidouche, MW LeBeane, WB Benton US Patent 10,740,163, 2020 | 1 | 2020 |
Optimizing communication for clusters of GPUs MW LeBeane | 1 | 2018 |