Parallel Programming with Polaris W Blume, R Doallo, R Eigenmann, J Grout, J Hoeflinger, T Lawrence, ... IEEE Computer 29 (12), 78-82, 1996 | 494 | 1996 |
SnuCL: an OpenCL framework for heterogeneous CPU/GPU clusters J Kim, S Seo, J Lee, J Nah, G Jo, J Lee Proceedings of the 26th ACM international conference on Supercomputing, 341-352, 2012 | 280 | 2012 |
Performance characterization of the NAS Parallel Benchmarks in OpenCL S Seo, G Jo, J Lee 2011 IEEE international symposium on workload characterization (IISWC), 137-148, 2011 | 268 | 2011 |
STeP: The Stanford Temporal Prover Z Manna, N Bjørner, A Browne, E Chang, M Colón, L de Alfaro, ... TAPSOFT'95: Theory and Practice of Software Development, 793-794, 1995 | 249 | 1995 |
Using a user-level memory thread for correlation prefetching Y Solihin, J Lee, J Torrellas Proceedings of the 29th Annual International Symposium on Computer …, 2002 | 233 | 2002 |
Achieving a single compute device image in OpenCL for multiple GPUs J Kim, H Kim, JH Lee, J Lee Proceedings of the 16th ACM symposium on Principles and practice of parallel …, 2011 | 190 | 2011 |
Using prime numbers for cache indexing to eliminate conflict misses M Kharbutli, K Irwin, Y Solihin, J Lee Proceedings of the 10th International Symposium on High Performance Computer …, 2004 | 154 | 2004 |
Performance analysis of CNN frameworks for GPUs H Kim, H Nam, W Jung, J Lee 2017 IEEE International Symposium on Performance Analysis of Systems and …, 2017 | 147 | 2017 |
Hydra: A block-mapped parallel flash memory solid-state disk architecture YJ Seong, EH Nam, JH Yoon, H Kim, J Choi, S Lee, YH Bae, J Lee, Y Cho, ... IEEE Transactions on Computers 59 (7), 905-921, 2010 | 133 | 2010 |
Concurrent static single assignment form and constant propagation for explicitly parallel programs J Lee, SP Midkiff, DA Padua International Workshop on Languages and Compilers for Parallel Computing …, 1997 | 133 | 1997 |
Compiler techniques for high performance sequentially consistent java programs Z Sura, X Fang, CL Wong, SP Midkiff, J Lee, D Padua Proceedings of the tenth ACM SIGPLAN symposium on Principles and practice of …, 2005 | 118 | 2005 |
Basic compiler algorithms for parallel programs J Lee, DA Padua, SP Midkiff ACM SIGPLAN Notices 34 (8), 1-12, 1999 | 115 | 1999 |
Hiding Relaxed Memory Consistency with a Compiler J Lee, D Padua Transactions on Computers 50 (8), 824-833, 2001 | 112 | 2001 |
Hiding relaxed memory consistency with compilers J Lee, DA Padua Proceedings of the 2000 International Conference on Parallel Architectures …, 2000 | 112 | 2000 |
Compiler-assisted demand paging for embedded systems with flash memory C Park, J Lim, K Kwon, J Lee, SL Min Proceedings of the 4th ACM international conference on Embedded software …, 2004 | 100 | 2004 |
Automatic fence insertion for shared memory multiprocessing X Fang, J Lee, SP Midkiff Proceedings of the 17th annual international conference on Supercomputing …, 2003 | 99 | 2003 |
An OpenCL framework for heterogeneous multicores with local memory J Lee, J Kim, S Seo, S Kim, J Park, H Kim, TT Dao, Y Cho, SJ Seo, SH Lee, ... Proceedings of the 19th international conference on Parallel architectures …, 2010 | 92 | 2010 |
Adaptive execution techniques for SMT multiprocessor architectures C Jung, D Lim, J Lee, SY Han Proceedings of the tenth ACM SIGPLAN symposium on Principles and practice of …, 2005 | 92 | 2005 |
Fast and space-efficient virtual machine checkpointing E Park, B Egger, J Lee ACM SIGPLAN Notices 46 (7), 75-86, 2011 | 91 | 2011 |
Scratchpad memory management for portable systems with a memory management unit B Egger, J Lee, H Shin Proceedings of the 6th ACM & IEEE International conference on Embedded …, 2006 | 88 | 2006 |