Gluon: A Communication-Optimizing Substrate for Distributed Heterogeneous Graph Analytics R Dathathri, G Gill, L Hoang, HV Dang, A Brooks, N Dryden, M Snir, ... PLDI 2018, 2018 | 147 | 2018 |
GPU-based high-performance computing for integrated surface–sub-surface flow modeling PVV Le, P Kumar, AJ Valocchi, HV Dang Environmental Modelling & Software 73 (1364-8152), 1-13, 2015 | 62 | 2015 |
Unicorn: Next generation cpu emulator framework NA Quynh, DH Vu BlackHat USA 476, 2015 | 61 | 2015 |
Designing scientific applications on GPUs R Couturier, (many authors) CRC Press, 2013 | 61 | 2013 |
CUDA-enabled Sparse Matrix–Vector Multiplication on GPUs using atomic operations HV Dang, B Schmidt Parallel Computing 39 (11), 737-750, 2013 | 53 | 2013 |
Towards millions of communicating threads HV Dang, M Snir, W Gropp European MPI group meeting (EuroMPI) 2016, Best paper, 2016 | 48 | 2016 |
Advanced Thread Synchronization for Multithreaded MPI Implementations HV Dang, S Seo, A Amer, P Balaji 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing …, 2017 | 32 | 2017 |
The Sliced COO format for sparse matrix-vector multiplication on cuda-enabled gpus HV Dang, B Schmidt Procedia Computer Science 9, 57-66, 2012 | 32 | 2012 |
Gluon-async: A bulk-asynchronous system for distributed and heterogeneous graph analytics R Dathathri, G Gill, L Hoang, V Jatala, K Pingali, VK Nandivada, HV Dang, ... 2019 28th International Conference on Parallel Architectures and Compilation …, 2019 | 31 | 2019 |
A Lightweight Communication Runtime for Distributed Graph Analytics HV Dang, A Brooks, N Dryden, M Snir, R Dathathri, G Gill, L Hoang, ... IPDPS 2018, 2018 | 29 | 2018 |
Scalable clustering by iterative partitioning and point attractor representation J Shao, Q Yang, HV Dang, B Schmidt, S Kramer ACM Transactions on Knowledge Discovery from Data (TKDD) 11 (1), 1-23, 2016 | 24 | 2016 |
Unicorn: The ultimate CPU emulator NA Quynh, HV Dang BlackHat 2015, 2015 | 21 | 2015 |
CUDA-enabled hierarchical ward clustering of protein structures based on the nearest neighbour chain algorithm HV Dang, B Schmidt, A Hildebrandt, TT Tran, AK Hildebrandt International Journal of High Performance Computing Applications 31 (3), 181-181, 2017 | 20* | 2017 |
Automatic Generation of I/O Kernels for HPC applications B Behzad, HV Dang, F Hariri, W Zhang, M Snir Parallel Data Storage Workshop (PDSW), 2014 9th, 2014, 2014 | 20 | 2014 |
Wikinetviz: Visualizing friends and adversaries in implicit social networks MT Le, HV Dang, EP Lim, A Datta 2008 IEEE International Conference on Intelligence and Security Informatics …, 2008 | 14 | 2008 |
Iterative sparse matrix-vector multiplication for integer factorization on GPUs B Schmidt, H Aribowo, HV Dang Euro-Par 2011 Parallel Processing: 17th International Conference, Euro-Par …, 2011 | 11 | 2011 |
Capstone engine NA Quynh, TS Di, B Nagy, DH Vu | 10 | 2021 |
PPL: An abstract runtime system for hybrid parallel programming A Brooks, HV Dang, N Dryden, M Snir ESPM2, First International Workshop on Extreme Scale Programming Models and …, 2015 | 7 | 2015 |
Eliminating contention bottlenecks in multithreaded MPI HV Dang, M Snir, W Gropp Parallel Computing 69, 1-23, 2017 | 6 | 2017 |
Iterative sparse matrix–vector multiplication for accelerating the block Wiedemann algorithm over GF (2) on multi‐graphics processing unit systems B Schmidt, H Aribowo, HV Dang Concurrency and Computation: Practice and Experience 25 (4), 586-603, 2013 | 6 | 2013 |