Improving main memory hash joins on Intel Xeon Phi processors S Jha, B He, M Lu, X Cheng, HP Huynh | 117 | 2015 |
Efficient GPU spatial-temporal multitasking Y Liang, HP Huynh, K Rupnow, RSM Goh, D Chen IEEE Transactions on Parallel and Distributed Systems 26 (3), 748-760, 2014 | 104 | 2014 |
Optimizing the mapreduce framework on intel xeon phi coprocessor M Lu, L Zhang, HP Huynh, Z Ong, Y Liang, B He, RSM Goh, R Huynh 2013 IEEE International Conference on Big Data, 125-130, 2013 | 81 | 2013 |
Optimizing and auto-tuning scale-free sparse matrix-vector multiplication on Intel Xeon Phi WT Tang, R Zhao, M Lu, Y Liang, HP Huynh, X Li, RSM Goh Code Generation and Optimization (CGO), 2015 IEEE/ACM International …, 2015 | 80 | 2015 |
Improving GPGPU energy-efficiency through concurrent kernel execution and DVFS Q Jiao, M Lu, HP Huynh, T Mitra 2015 IEEE/ACM International Symposium on Code Generation and Optimization …, 2015 | 78 | 2015 |
Scalable framework for mapping streaming applications onto multi-GPU systems HP Huynh, A Hagiescu, WF Wong, RSM Goh Proceedings of the 17th ACM SIGPLAN symposium on Principles and Practice of …, 2012 | 70 | 2012 |
Hierarchical parallel algorithm for modularity-based community detection using GPUs CY Cheong, HP Huynh, D Lo, RSM Goh Euro-Par 2013 Parallel Processing: 19th International Conference, Aachen …, 2013 | 49 | 2013 |
Mrphi: An optimized mapreduce framework on intel xeon phi coprocessors M Lu, Y Liang, HP Huynh, Z Ong, B He, RSM Goh IEEE Transactions on Parallel and Distributed Systems 26 (11), 3066-3078, 2014 | 45 | 2014 |
Automated architecture-aware mapping of streaming applications onto GPUs A Hagiescu, HP Huynh, WF Wong, RSM Goh 2011 IEEE International Parallel & Distributed Processing Symposium, 467-478, 2011 | 43 | 2011 |
An efficient framework for dynamic reconfiguration of instruction-set customization HP Huynh, JE Sim, T Mitra Proceedings of the 2007 international conference on Compilers, architecture …, 2007 | 40 | 2007 |
Exploiting sparsity to accelerate fully connected layers of cnn-based applications on mobile socs X Xie, D Du, Q Li, Y Liang, WT Tang, ZL Ong, M Lu, HP Huynh, RSM Goh ACM Transactions on Embedded Computing Systems (TECS) 17 (2), 1-25, 2017 | 26 | 2017 |
Runtime Adaptive Extensible Embedded Processors—A Survey HP Huynh, T Mitra International Workshop on Embedded Computer Systems, 215-225, 2009 | 25 | 2009 |
Scale-free sparse matrix-vector multiplication on many-core architectures Y Liang, WT Tang, R Zhao, M Lu, HP Huynh, RSM Goh IEEE Transactions on Computer-Aided Design of Integrated Circuits and …, 2017 | 19 | 2017 |
Mapping streaming applications onto GPU systems HP Huynh, A Hagiescu, OZ Liang, WF Wong, RSM Goh IEEE Transactions on Parallel and Distributed Systems 25 (9), 2374-2385, 2013 | 17 | 2013 |
Efficient custom instructions generation for system-level design HP Huynh, Y Liang, T Mitra 2010 International Conference on Field-Programmable Technology, 445-448, 2010 | 16 | 2010 |
Evaluating design trade-offs in customizable processors UD Bordoloi, HP Huynh, S Chakraborty, T Mitra Proceedings of the 46th Annual Design Automation Conference, 244-249, 2009 | 15 | 2009 |
Efficient query processing on many-core architectures: A case study with intel xeon phi processor X Cheng, B He, M Lu, CT Lau, HP Huynh, RSM Goh Proceedings of the 2016 International Conference on Management of Data, 2081 …, 2016 | 14 | 2016 |
Runtime reconfiguration of custom instructions for real-time embedded systems HP Huynh, T Mitra 2009 Design, Automation & Test in Europe Conference & Exhibition, 1536-1541, 2009 | 14 | 2009 |
Instruction-set customization for real-time embedded systems HP Huynh, T Mitra 2007 Design, Automation & Test in Europe Conference & Exhibition, 1-6, 2007 | 11 | 2007 |
Design space exploration of instruction set customizable MPSoCs for multimedia applications UD Bordoloi, HP Huynh, T Mitra, S Chakraborty 2010 International Conference on Embedded Computer Systems: Architectures …, 2010 | 9 | 2010 |