FX! 32: A profile-directed binary translator A Chernoff, M Herdeg, R Hookway, C Reeve, N Rubin, T Tye, ... IEEE Micro 18 (02), 56-64, 1998 | 363 | 1998 |
Spike: An optimizer for Alpha/NT executables R Cohn, D Goodwin, PG Lowney, N Rubin USENIX Windows NT Workshop, 17-24, 1997 | 111 | 1997 |
Efficient instruction scheduling using finite state automata V Bala, N Rubin International Journal of Parallel Programming 25, 53-82, 1997 | 95 | 1997 |
User transparent mechanism for profile feedback optimization DW Goodwin, RS Cohn, PG Lowney, N Rubin US Patent 6,158,049, 2000 | 93 | 2000 |
Optimizing alpha executables on windows nt with spike RS Cohn, DW Goodwin, PG Lowney, N Rubin Digital Technical Journal 9, 3-20, 1997 | 92 | 1997 |
Dynamic thread block launch: A lightweight execution mechanism to support irregular applications on gpus J Wang, N Rubin, A Sidelnik, S Yalamanchili ACM SIGARCH Computer Architecture News 43 (3S), 528-540, 2015 | 86 | 2015 |
Wiggins/Redstone: An on-line program specializer D Deaver, R Gorton, N Rubin Proceedings of the IEEE Hot Chips XI Conference, 1999 | 68 | 1999 |
Laperm: Locality aware scheduler for dynamic parallelism on gpus J Wang, N Rubin, A Sidelnik, S Yalamanchili ACM SIGARCH Computer Architecture News 44 (3), 583-595, 2016 | 67 | 2016 |
Shared memory multiplexing: A novel way to improve GPGPU throughput Y Yang, P Xiang, M Mantor, N Rubin, H Zhou Proceedings of the 21st international conference on Parallel architectures …, 2012 | 66 | 2012 |
Diesel: DSL for linear algebra and neural net computations on GPUs V Elango, N Rubin, M Ravishankar, H Sandanagobalane, V Grover Proceedings of the 2nd ACM SIGPLAN International Workshop on Machine …, 2018 | 62 | 2018 |
Enabling task-level scheduling on heterogeneous platforms E Sun, D Schaa, R Bagley, N Rubin, D Kaeli Proceedings of the 5th Annual Workshop on General Purpose Processing with …, 2012 | 55 | 2012 |
Exploiting uniform vector instructions for GPGPU performance, energy efficiency, and opportunistic reliability enhancement P Xiang, Y Yang, M Mantor, N Rubin, LR Hsu, H Zhou Proceedings of the 27th international ACM conference on International …, 2013 | 47 | 2013 |
Griffin: Hardware-software support for efficient page migration in multi-gpu systems T Baruah, Y Sun, AT Dinçer, SA Mojumder, JL Abellán, Y Ukidave, ... 2020 IEEE International Symposium on High Performance Computer Architecture …, 2020 | 45 | 2020 |
Analyzing program flow within a many-kernel OpenCL application P Mistry, C Gregg, N Rubin, D Kaeli, K Hazelwood Proceedings of the Fourth Workshop on General Purpose Processing on Graphics …, 2011 | 39 | 2011 |
Prism: Predicting resilience of gpu applications using statistical methods C Kalra, F Previlon, X Li, N Rubin, D Kaeli SC18: International Conference for High Performance Computing, Networking …, 2018 | 35 | 2018 |
Another generalization of resolution N Rubin, MC Harrison Journal of the ACM (JACM) 25 (3), 341-351, 1978 | 33 | 1978 |
Armorall: Compiler-based resilience targeting gpu applications C Kalra, F Previlon, N Rubin, D Kaeli ACM Transactions on Architecture and Code Optimization (TACO) 17 (2), 1-24, 2020 | 30 | 2020 |
Method and system for workitem synchronization LW Howes, BR Gaster, MC Houston, M Mantor, M Leather, N Rubin, ... US Patent 8,607,247, 2013 | 28 | 2013 |
Characterizing scalar opportunities in GPGPU applications Z Chen, D Kaeli, N Rubin 2013 IEEE International Symposium on Performance Analysis of Systems and …, 2013 | 26 | 2013 |
A new method for gpu based irregular reductions and its application to k-means clustering B Dhanasekaran, N Rubin Proceedings of the fourth workshop on general purpose processing on graphics …, 2011 | 24 | 2011 |