Scalable bayesian optimization using deep neural networks J Snoek, O Rippel, K Swersky, R Kiros, N Satish, N Sundaram, M Patwary, ... International conference on machine learning, 2171-2180, 2015 | 1259 | 2015 |
Deep learning recommendation model for personalization and recommendation systems M Naumov, D Mudigere, HJM Shi, J Huang, N Sundaraman, J Park, ... arXiv preprint arXiv:1906.00091, 2019 | 672 | 2019 |
Dense point trajectories by gpu-accelerated large displacement optical flow N Sundaram, T Brox, K Keutzer European conference on computer vision, 438-451, 2010 | 609 | 2010 |
Fast support vector machine training and classification on graphics processors B Catanzaro, N Sundaram, K Keutzer Proceedings of the 25th international conference on Machine learning, 104-111, 2008 | 540 | 2008 |
Graphicionado: A high-performance and energy-efficient accelerator for graph analytics TJ Ham, L Wu, N Sundaram, N Satish, M Martonosi 2016 49th annual IEEE/ACM international symposium on microarchitecture …, 2016 | 415 | 2016 |
Graphmat: High performance graph analytics made productive N Sundaram, NR Satish, MMA Patwary, SR Dulloor, SG Vadlamudi, ... arXiv preprint arXiv:1503.07241, 2015 | 392 | 2015 |
Data tiering in heterogeneous memory systems SR Dulloor, A Roy, Z Zhao, N Sundaram, N Satish, R Sankaran, ... Proceedings of the Eleventh European Conference on Computer Systems, 1-16, 2016 | 262 | 2016 |
Navigating the maze of graph analytics frameworks using massive graph datasets N Satish, N Sundaram, MMA Patwary, J Seo, J Park, MA Hassaan, ... Proceedings of the 2014 ACM SIGMOD international conference on Management of …, 2014 | 246 | 2014 |
Efficient, high-quality image contour detection B Catanzaro, BY Su, N Sundaram, Y Lee, M Murphy, K Keutzer 2009 IEEE 12th International Conference on Computer Vision, 2381-2388, 2009 | 183 | 2009 |
Streaming similarity search over one billion tweets using parallel locality-sensitive hashing N Sundaram, A Turmukhametova, N Satish, T Mostak, P Indyk, S Madden, ... Proceedings of the VLDB Endowment 6 (14), 1930-1941, 2013 | 169 | 2013 |
LDBC Graphalytics: A benchmark for large-scale graph analysis on parallel and distributed platforms A Iosup, T Hegeman, WL Ngai, S Heldens, A Prat-Pérez, T Manhardto, ... Proceedings of the VLDB Endowment 9 (13), 1317-1328, 2016 | 154 | 2016 |
A map reduce framework for programming graphics processors B Catanzaro, N Sundaram, K Keutzer Workshop on Software Tools for MultiCore Systems, 2008 | 114 | 2008 |
Deep learning at 15pf: supervised and semi-supervised classification for scientific data T Kurth, J Zhang, N Satish, E Racah, I Mitliagkas, MMA Patwary, T Malas, ... Proceedings of the International Conference for High Performance Computing …, 2017 | 95 | 2017 |
Sparsifying synchronization for high-performance shared-memory sparse triangular solver J Park, M Smelyanskiy, N Sundaram, P Dubey Supercomputing: 29th International Conference, ISC 2014, Leipzig, Germany …, 2014 | 93 | 2014 |
Graphin: An online high performance incremental graph processing framework D Sengupta, N Sundaram, X Zhu, TL Willke, J Young, M Wolf, K Schwan Euro-Par 2016: Parallel Processing: 22nd International Conference on …, 2016 | 87 | 2016 |
Parallel Efficient Sparse Matrix-Matrix Multiplication on Multicore Platforms N Sundaram, J Park, MJ Anderson, SG Vadlamudi, D Das, SG Pudov, ... High Performance Computing: 30th International Conference, ISC High …, 2015 | 79* | 2015 |
Bridging the gap between HPC and big data frameworks M Anderson, S Smith, N Sundaram, M Capotă, Z Zhao, S Dulloor, ... Proceedings of the VLDB Endowment 10 (8), 901-912, 2017 | 77 | 2017 |
BD-CATS: big data clustering at trillion particle scale MMA Patwary, S Byna, NR Satish, N Sundaram, Z Lukić, V Roytershteyn, ... Proceedings of the International Conference for High Performance Computing …, 2015 | 74 | 2015 |
Graphpad: Optimized graph primitives for parallel and distributed platforms MJ Anderson, N Sundaram, N Satish, MMA Patwary, TL Willke, P Dubey 2016 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2016 | 72 | 2016 |
A framework for efficient and scalable execution of domain-specific templates on GPUs N Sundaram, A Raghunathan, ST Chakradhar 2009 IEEE International Symposium on Parallel & Distributed Processing, 1-12, 2009 | 59 | 2009 |