A malicious pattern detection engine for embedded security systems in the Internet of Things D Oh, D Kim, WW Ro Sensors 14 (12), 24188-24211, 2014 | 158 | 2014 |
Warped-compression: Enabling power efficient GPUs through register compression S Lee, K Kim, G Koo, H Jeon, WW Ro, M Annavaram ACM SIGARCH Computer Architecture News 43 (3S), 502-514, 2015 | 143 | 2015 |
Warped-slicer: Efficient intra-SM slicing through dynamic resource partitioning for GPU multiprogramming Q Xu, H Jeon, K Kim, WW Ro, M Annavaram ACM SIGARCH Computer Architecture News 44 (3), 230-242, 2016 | 136 | 2016 |
Access pattern-aware cache management for improving data utilization in GPU G Koo, Y Oh, WW Ro, M Annavaram Proceedings of the 44th annual international symposium on computer …, 2017 | 80 | 2017 |
Fast CU depth decision for HEVC using neural networks K Kim, WW Ro IEEE Transactions on Circuits and Systems for Video Technology 29 (5), 1462-1473, 2018 | 72 | 2018 |
Virtual thread: Maximizing thread-level parallelism beyond GPU scheduling limit MK Yoon, K Kim, S Lee, WW Ro, M Annavaram ACM SIGARCH Computer Architecture News 44 (3), 609-621, 2016 | 65 | 2016 |
Warped-preexecution: A GPU pre-execution approach for improving latency hiding K Kim, S Lee, MK Yoon, G Koo, WW Ro, M Annavaram 2016 IEEE International Symposium on High Performance Computer Architecture …, 2016 | 58 | 2016 |
Xsd: Accelerating mapreduce by harnessing the gpu inside an ssd BY Cho, WS Jeong, D Oh, WW Ro | 56 | 2013 |
Efficient peer-to-peer file sharing using network coding in MANET U Lee, JS Park, SH Lee, WW Ro, G Pau, M Gerla Journal of Communications and Networks 10 (4), 422-429, 2008 | 50 | 2008 |
APRES: Improving cache efficiency by exploiting load characteristics on GPUs Y Oh, K Kim, MK Yoon, JH Park, Y Park, WW Ro, M Annavaram ACM SIGARCH computer architecture news 44 (3), 191-203, 2016 | 43 | 2016 |
Boosting CUDA applications with CPU–GPU hybrid computing C Lee, WW Ro, JL Gaudiot International Journal of Parallel Programming 42 (2), 384-404, 2014 | 40 | 2014 |
Parallel GPU architecture simulation framework exploiting work allocation unit parallelism S Lee, WW Ro 2013 IEEE International Symposium on Performance Analysis of Systems and …, 2013 | 33 | 2013 |
On improving parallelized network coding with dynamic partitioning K Park, JS Park, WW Ro IEEE Transactions on Parallel and Distributed Systems 21 (11), 1547-1560, 2010 | 32 | 2010 |
Space: locality-aware processing in heterogeneous memory for personalized recommendations H Kal, S Lee, G Ko, WW Ro 2021 ACM/IEEE 48th Annual International Symposium on Computer Architecture …, 2021 | 28 | 2021 |
Mgmr: Multi-gpu based mapreduce Y Chen, Z Qiao, H Jiang, KC Li, WW Ro Grid and Pervasive Computing: 8th International Conference, GPC 2013 and …, 2013 | 27 | 2013 |
Cooperative heterogeneous computing for parallel processing on CPU/GPU hybrids C Lee, WW Ro, JL Gaudiot 2012 16th Workshop on Interaction between Compilers and Computer …, 2012 | 26 | 2012 |
Improving energy efficiency of gpus through data compression and compressed execution S Lee, K Kim, G Koo, H Jeon, M Annavaram, WW Ro IEEE Transactions on Computers 66 (5), 834-847, 2016 | 25 | 2016 |
Accelerated network coding with dynamic stream decomposition on graphics processing unit S Lee, WW Ro The Computer Journal 55 (1), 21-34, 2012 | 24 | 2012 |
WIR: Warp instruction reuse to minimize repeated computations in GPUs K Kim, WW Ro 2018 IEEE International Symposium on High Performance Computer Architecture …, 2018 | 23 | 2018 |
Delay analysis of car-to-car reliable data delivery strategies based on data mulling with network coding JS Park, U Lee, SY Oh, M Gerla, DS Lun, WW Ro, J Park IEICE transactions on information and systems 91 (10), 2524-2527, 2008 | 23 | 2008 |