MGPUSim: enabling multi-GPU performance modeling and optimization Y Sun, T Baruah, SA Mojumder, S Dong, X Gong, S Treadway, Y Bao, ... Proceedings of the 46th International Symposium on Computer Architecture …, 2019 | 103* | 2019 |
Stonne: Enabling cycle-level microarchitectural simulation for dnn inference accelerators F Muñoz-Martínez, JL Abellán, ME Acacio, T Krishna 2021 IEEE International Symposium on Workload Characterization (IISWC), 201-213, 2021 | 69* | 2021 |
Thermal management of manycore systems with silicon-photonic networks T Zhang, JL Abellán, A Joshi, AK Coskun 2014 Design, Automation & Test in Europe Conference & Exhibition (DATE), 1-6, 2014 | 67 | 2014 |
Profiling dnn workloads on a volta-based dgx-1 system SA Mojumder, MS Louis, Y Sun, AK Ziabari, JL Abellán, J Kim, D Kaeli, ... 2018 IEEE International Symposium on Workload Characterization (IISWC), 122-133, 2018 | 49 | 2018 |
Asymmetric NoC architectures for GPU systems AK Ziabari, JL Abellán, Y Ma, A Joshi, D Kaeli Proceedings of the 9th International Symposium on Networks-on-Chip, 1-8, 2015 | 49 | 2015 |
Glocks: Efficient support for highly-contended locks in many-core cmps JL Abell, J Fern, ME Acacio 2011 IEEE International Parallel & Distributed Processing Symposium, 893-905, 2011 | 47 | 2011 |
Griffin: Hardware-software support for efficient page migration in multi-gpu systems T Baruah, Y Sun, AT Dinçer, SA Mojumder, JL Abellán, Y Ukidave, ... 2020 IEEE International Symposium on High Performance Computer Architecture …, 2020 | 44 | 2020 |
Leveraging silicon-photonic noc for designing scalable gpus AKK Ziabari, JL Abellán, R Ubal, C Chen, A Joshi, D Kaeli Proceedings of the 29th ACM on International Conference on Supercomputing …, 2015 | 43 | 2015 |
Adaptive tuning of photonic devices in a photonic NoC through dynamic workload allocation JL Abellán, AK Coskun, A Gu, W Jin, A Joshi, AB Kahng, J Klamkin, ... IEEE Transactions on Computer-Aided Design of Integrated Circuits and …, 2016 | 38 | 2016 |
Efficient hardware barrier synchronization in many-core cmps JL Abellán, J Fernández, ME Acacio IEEE Transactions on Parallel and Distributed Systems 23 (8), 1453-1466, 2011 | 36 | 2011 |
UMH: A hardware-based unified memory hierarchy for systems with multiple discrete GPUs AK Ziabari, Y Sun, Y Ma, D Schaa, JL Abellán, R Ubal, J Kim, A Joshi, ... ACM Transactions on Architecture and Code Optimization (TACO) 13 (4), 1-25, 2016 | 35 | 2016 |
Managing laser power in silicon-photonic NoC through cache and NoC reconfiguration C Chen, JL Abellán, A Joshi IEEE Transactions on Computer-Aided Design of Integrated Circuits and …, 2015 | 34 | 2015 |
Gnnmark: A benchmark suite to characterize graph neural network training on gpus T Baruah, K Shivdikar, S Dong, Y Sun, SA Mojumder, K Jung, JL Abellán, ... 2021 IEEE International Symposium on Performance Analysis of Systems and …, 2021 | 33 | 2021 |
Understanding the design-space of sparse/dense multiphase GNN dataflows on spatial accelerators R Garg, E Qin, F Muñoz-Matrínez, R Guirado, A Jain, S Abadal, ... 2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2022 | 31* | 2022 |
High-throughput ant colony optimization on graphics processing units JM Cecilia, A Llanes, JL Abellan, J Gomez-Luna, LW Chang, WMW Hwu Journal of Parallel and Distributed Computing 113, 261-274, 2018 | 29 | 2018 |
A g-line-based network for fast and efficient barrier synchronization in many-core cmps JL Abellán, J Fernández, ME Acacio 2010 39th International Conference on Parallel Processing, 267-276, 2010 | 25 | 2010 |
Efficient and scalable barrier synchronization for many-core cmps JL Abellán, J Fernández, ME Acacio Proceedings of the 7th ACM international conference on Computing frontiers …, 2010 | 24* | 2010 |
TAP-2.5 D: A thermally-aware chiplet placement methodology for 2.5 D systems Y Ma, L Delshadtehrani, C Demirkiran, JL Abellan, A Joshi 2021 Design, Automation & Test in Europe Conference & Exhibition (DATE …, 2021 | 23 | 2021 |
Flexagon: A multi-dataflow sparse-sparse matrix multiplication accelerator for efficient dnn processing F Muñoz-Martínez, R Garg, M Pellauer, JL Abellán, ME Acacio, T Krishna Proceedings of the 28th ACM International Conference on Architectural …, 2023 | 20 | 2023 |
Accelerating polynomial multiplication for homomorphic encryption on GPUs K Shivdikar, G Jonatan, E Mora, N Livesay, R Agrawal, A Joshi, ... 2022 IEEE International Symposium on Secure and Private Execution …, 2022 | 20 | 2022 |