Characterization and analysis of dynamic parallelism in unstructured GPU applications J Wang, S Yalamanchili 2014 IEEE International Symposium on Workload Characterization (IISWC), 51-60, 2014 | 110 | 2014 |
Optimizing data warehousing applications for GPUs using kernel fusion/fission H Wu, G Diamos, J Wang, S Cadambi, S Yalamanchili, S Chakradhar Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW …, 2012 | 87 | 2012 |
Dynamic thread block launch: A lightweight execution mechanism to support irregular applications on gpus J Wang, N Rubin, A Sidelnik, S Yalamanchili ACM SIGARCH Computer Architecture News 43 (3S), 528-540, 2015 | 86 | 2015 |
Laperm: Locality aware scheduler for dynamic parallelism on gpus J Wang, N Rubin, A Sidelnik, S Yalamanchili ACM SIGARCH Computer Architecture News 44 (3), 583-595, 2016 | 67 | 2016 |
Efficient relational algebra algorithms and data structures for GPU GF Diamos, H Wu, A Lele, J Wang Georgia Institute of Technology, 2012 | 46 | 2012 |
Characterization and transformation of unstructured control flow in bulk synchronous GPU applications H Wu, G Diamos, J Wang, S Li, S Yalamanchili The International Journal of High Performance Computing Applications 26 (2 …, 2012 | 45 | 2012 |
Relational algorithms for multi-bulk-synchronous processors G Diamos, H Wu, J Wang, A Lele, S Yalamanchili ACM SIGPLAN Notices 48 (8), 301-302, 2013 | 30 | 2013 |
Accelerating simulation of agent-based models on heterogeneous architectures J Wang, N Rubin, H Wu, S Yalamanchili Proceedings of the 6th Workshop on General Purpose Processor Using Graphics …, 2013 | 19 | 2013 |
Paralleljs: An execution framework for javascript on heterogeneous systems J Wang, N Rubin, S Yalamanchili Proceedings of Workshop on General Purpose Processing Using GPUs, 72-80, 2014 | 15 | 2014 |
Next-generation consumer audio application specific embedded processor J Kong, P Liu, X Chen, J Wang, X Pan, J Wang, H Xiao, Z Wei, R Ying 2010 IEEE 8th Symposium on Application Specific Processors (SASP), 1-7, 2010 | 10 | 2010 |
General-purpose join algorithms for large graph triangle listing on heterogeneous systems D Zinn, H Wu, J Wang, M Aref, S Yalamanchili Proceedings of the 9th Annual Workshop on General Purpose Processing Using …, 2016 | 8 | 2016 |
Acceleration and optimization of dynamic parallelism for irregular applications on GPUs J Wang Georgia Institute of Technology, 2016 | 4 | 2016 |
Split table extension: A low complexity LVQ extension scheme in low bitrate audio coding J Wang, P Liu, J Kong, R Ying IEEE Signal Processing Letters 17 (1), 59-62, 2009 | 3 | 2009 |
Exploring dynamic parallelism for irregular applications on gpus J Wang, N Rubin, A Sidelnik, S Yalamanchili Vertex 1, 3, 0 | 1 | |