Speculative reconvergence for improved SIMT efficiency S Damani, DR Johnson, M Stephenson, SW Keckler, E Yan, M McKeown, ... Proceedings of the 18th ACM/IEEE International Symposium on Code Generation …, 2020 | 13 | 2020 |
Vegeta: Vertically-integrated extensions for sparse/dense gemm tile acceleration on cpus G Jeong, S Damani, AR Bambhaniya, E Qin, CJ Hughes, S Subramoney, ... 2023 IEEE International Symposium on High-Performance Computer Architecture …, 2023 | 12 | 2023 |
Gpu subwarp interleaving S Damani, M Stephenson, R Rangan, D Johnson, R Kulkami, SW Keckler 2022 IEEE International Symposium on High-Performance Computer Architecture …, 2022 | 7 | 2022 |
CuCatch: A Debugging Tool for Efficiently Catching Memory Safety Violations in CUDA Applications M Tarek Ibn Ziad, S Damani, A Jaleel, SW Keckler, M Stephenson Proceedings of the ACM on Programming Languages 7 (PLDI), 124-147, 2023 | 4 | 2023 |
Common subexpression convergence: A new code optimization for simt processors S Damani, V Sarkar Languages and Compilers for Parallel Computing: 32nd International Workshop …, 2021 | 4 | 2021 |
Techniques for divergent thread group execution scheduling S Damani, M Stephenson, R Rangan, DR Johnson, R Kulkarni US Patent 11,934,867, 2024 | | 2024 |
WASP: Exploiting GPU Pipeline Parallelism with Hardware-Accelerated Automatic Warp Specialization NC Crago, S Damani, K Sankaralingam, SW Keckler 2024 IEEE International Symposium on High-Performance Computer Architecture …, 2024 | | 2024 |
Convergence among concurrently executing threads DR Johnson, J Choquette, O Giroux, MP McKeown, M Stephenson, ... US Patent 11,847,508, 2023 | | 2023 |
Software-directed register file sharing S Damani, S Treichler, M Stephenson US Patent App. 17/697,325, 2023 | | 2023 |
Software-directed divergent branch target prioritization S Damani, S Treichler, M Stephenson, DR Johnson US Patent App. 17/568,514, 2023 | | 2023 |
Convergence among concurrently executing threads DR Johnson, J Choquette, O Giroux, MP McKeown, M Stephenson, ... US Patent 11,442,795, 2022 | | 2022 |
OPTIMIZED SCHEDULING AND RESOURCE ALLOCATION FOR THREAD PARALLEL ARCHITECTURES S Damani Georgia Institute of Technology, 2022 | | 2022 |
Memory access scheduling to reduce thread migrations S Damani, P Barua, V Sarkar Proceedings of the 31st ACM SIGPLAN International Conference on Compiler …, 2022 | | 2022 |