Stash: Have your scratchpad and cache it too R Komuravelli, MD Sinclair, J Alsop, M Kotsifakou, P Srivastava, SV Adve, ... Proceedings of the 42nd Annual International Symposium on Computer …, 2015 | 101 | 2015 |
Efficient GPU synchronization without scopes: Saying no to complex consistency models MD Sinclair, J Alsop, SV Adve Proceedings of the 48th International Symposium on Microarchitecture, 647-659, 2015 | 85 | 2015 |
Spandex: A flexible interface for efficient heterogeneous coherence J Alsop, M Sinclair, S Adve 2018 ACM/IEEE 45th Annual International Symposium on Computer Architecture …, 2018 | 56 | 2018 |
Lazy release consistency for GPUs J Alsop, MS Orr, BM Beckmann, DA Wood 2016 49th Annual IEEE/ACM International Symposium on Microarchitecture …, 2016 | 54 | 2016 |
Chasing away RAts: Semantics and evaluation for relaxed atomics on heterogeneous systems MD Sinclair, J Alsop, SV Adve Proceedings of the 44th Annual International Symposium on Computer …, 2017 | 51 | 2017 |
HeteroSync: A benchmark suite for fine-grained synchronization on tightly coupled GPUs MD Sinclair, J Alsop, SV Adve 2017 IEEE International Symposium on Workload Characterization (IISWC), 239-249, 2017 | 29 | 2017 |
Inter-kernel reuse-aware thread block scheduling M Huzaifa, J Alsop, A Mahmoud, G Salvador, MD Sinclair, SV Adve ACM Transactions on Architecture and Code Optimization (TACO) 17 (3), 1-27, 2020 | 22 | 2020 |
Optimizing GPU cache policies for MI workloads J Alsop, MD Sinclair, S Bharadwaj, A Dutu, A Gutierrez, O Kayiran, ... 2019 IEEE International Symposium on Workload Characterization (IISWC), 243-248, 2019 | 12 | 2019 |
Specializing coherence, consistency, and push/pull for gpu graph analytics G Salvador, WH Darvin, M Huzaifa, J Alsop, MD Sinclair, SV Adve 2020 IEEE International Symposium on Performance Analysis of Systems and …, 2020 | 11 | 2020 |
GSI: A GPU stall inspector to characterize the sources of memory stalls for tightly coupled GPUs J Alsop, MD Sinclair, R Komuravelli, SV Adve 2016 IEEE International Symposium on Performance Analysis of Systems and …, 2016 | 10 | 2016 |
A Research Retrospective on AMD's Exascale Computing Journey GH Loh, MJ Schulte, M Ignatowski, V Adhinarayanan, S Aga, D Aguren, ... Proceedings of the 50th Annual International Symposium on Computer …, 2023 | 6 | 2023 |
Dynamic multi-bank memory command coalescing J Alsop, SD Aga US Patent 11,681,465, 2023 | 3 | 2023 |
Bank-Level Parallelism for Processing in Memory M Islam, SD Aga, JR Alsop, MAAEM Ibrahim, NS Jayasena US Patent App. 17/953,723, 2024 | 1 | 2024 |
System and method for coalesced multicast data transfers over memory interfaces J Alsop, N Jayasena, AGA Shaizeen, A McCrabb US Patent 11,803,311, 2023 | 1 | 2023 |
Dynamically coalescing atomic memory operations for memory-local computing J Alsop, A Dutu, AGA Shaizeen, N Jayasena US Patent 11,726,918, 2023 | 1 | 2023 |
Enforcing data placement requirements via address bit swapping J Alsop, AGA Shaizeen US Patent App. 17/218,994, 2022 | 1 | 2022 |
A case for fine-grain coherence specialization in heterogeneous systems J Alsop, WT Na, MD Sinclair, S Grayson, S Adve ACM Transactions on Architecture and Code Optimization (TACO) 19 (3), 1-26, 2022 | 1 | 2022 |
Memory access commands with near-memory address generation AGA Shaizeen, N Jayasena, J Alsop US Patent 11,216,373, 2022 | 1 | 2022 |
Arbitrating atomic memory operations S Blagodurov, J Alsop, JB Kotra, M Scrbak, G Dasika US Patent 12,019,566, 2024 | | 2024 |
Programmable Data Storage Memory Hierarchy SD Aga, JR Alsop, NS Jayasena US Patent App. 18/068,670, 2024 | | 2024 |