Access pattern-aware cache management for improving data utilization in GPU G Koo, Y Oh, WW Ro, M Annavaram Proceedings of the 44th annual international symposium on computer …, 2017 | 80 | 2017 |
APRES: Improving cache efficiency by exploiting load characteristics on GPUs Y Oh, K Kim, MK Yoon, JH Park, Y Park, WW Ro, M Annavaram Proceedings of the 43rd International Symposium on Computer Architecture …, 2016 | 43 | 2016 |
Rebooting Virtual Memory with Midgard S Gupta, A Bhattacharyya, Y Oh, A Bhattacharjee, B Falsafi, M Payer Proceedings of the 48th International Symposium on Computer Architecture (ISCA), 2021 | 23 | 2021 |
Duplo: Lifting Redundant Memory Accesses of Deep Neural Networks for GPU Tensor Cores H Kim, S Ahn, Y Oh, B Kim, WW Ro, WJ Song 2020 53rd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO), 2020 | 21 | 2020 |
FineReg: Fine-grained register file management for augmenting GPU throughput Y Oh, MK Yoon, WJ Song, WW Ro 2018 51st Annual IEEE/ACM International Symposium on Microarchitecture …, 2018 | 20 | 2018 |
Scale-out Systolic Arrays AC Yüzügüler, C Sönmez, M Drumond, Y Oh, B Falsafi, P Frossard ACM Transactions on Architecture and Code Optimization, 2023 | 12 | 2023 |
Linebacker: Preserving victim cache lines in idle register files of GPUs Y Oh, G Koo, M Annavaram, WW Ro Proceedings of the 46th International Symposium on Computer Architecture …, 2019 | 12 | 2019 |
Draw: investigating benefits of adaptive fetch group size on gpu MK Yoon, Y Oh, S Lee, SH Kim, D Kim, WW Ro 2015 IEEE International Symposium on Performance Analysis of Systems and …, 2015 | 12 | 2015 |
Adaptive Cooperation of Prefetching and Warp Scheduling on GPUs Y Oh, K Kim, MK Yoon, JH Park, Y Park, M Annavaram, WW Ro IEEE Transactions on Computers 68 (4), 609-616, 2019 | 10 | 2019 |
GPU-friendly parallel genome matching with tiled access and reduced state transition table Y Oh, D Oh, WW Ro International Journal of Parallel Programming 41, 526-551, 2013 | 9 | 2013 |
Hardware implementation of a tessellation accelerator for the OpenVG standard SH Kim, Y Oh, K Park, WW Ro ieice electronics express 7 (6), 440-446, 2010 | 7 | 2010 |
WASP: Selective data prefetching with monitoring runtime warp progress on GPUs Y Oh, MK Yoon, JH Park, Y Park, WW Ro IEEE Transactions on Computers 67 (9), 1366-1373, 2018 | 6 | 2018 |
Snakebyte: A tlb design with adaptive and recursive page merging in gpus J Lee, JM Lee, Y Oh, WJ Song, WW Ro 2023 IEEE International Symposium on High-Performance Computer Architecture …, 2023 | 5 | 2023 |
CASH-RF: A Compiler-Assisted Hierarchical Register File in GPUs Y Oh, I Jeong, WW Ro, MK Yoon IEEE Embedded Systems Letters, 2022 | 5 | 2022 |
Dynamic resizing on active warps scheduler to hide operation stalls on GPUs MK Yoon, Y Oh, SH Kim, S Lee, D Kim, WW Ro IEEE Transactions on Parallel and Distributed Systems 28 (11), 3142-3156, 2017 | 4 | 2017 |
R2D2: Removing ReDunDancy Utilizing Linearity of Address Generation in GPUs D Ha, Y Oh, WW Ro Proceedings of the 50th Annual International Symposium on Computer …, 2023 | 3 | 2023 |
AstriFlash: A Flash-Based System for Online Services S Gupta, Y Oh, L Yan, MJ Sutherland, A Bhattacharjee, B Falsafi, P Hsu The 29th IEEE International Symposium on High-Performance Computer …, 2023 | 3 | 2023 |
Accuracy Boosters: Epoch-Driven Mixed-Mantissa Block Floating-Point for DNN Training SB Harma, A Chakraborty, B Falsafi, M Jaggi, Y Oh arXiv preprint arXiv:2211.10737, 2022 | 3 | 2022 |
Central processing unit, GPU simulation method thereof, and computing system including the same WW Ro, K Park, YH Oh, SP Lee, M Kim US Patent 9,378,533, 2016 | 3 | 2016 |
Effective Interplay between Sparsity and Quantization: From Theory to Practice SB Harma, A Chakraborty, E Kostenok, D Mishin, D Ha, B Falsafi, M Jaggi, ... arXiv preprint arXiv:2405.20935, 2024 | 2 | 2024 |