Heterogeneous computing with OpenCL B Gaster, L Howes, DR Kaeli, P Mistry, D Schaa Morgan Kaufmann, 2011 | 591* | 2011 |
A comparison of CPUs, GPUs, FPGAs, and massively parallel processor arrays for random number generation DB Thomas, L Howes, W Luk Proceedings of the ACM/SIGDA international symposium on Field programmable …, 2009 | 254 | 2009 |
Performance comparison of graphics processors to reconfigurable logic: A case study B Cope, PYK Cheung, W Luk, L Howes IEEE Transactions on computers 59 (4), 433-448, 2010 | 153 | 2010 |
Efficient random number generation and application using CUDA L Howes, D Thomas GPU gems 3, 805-830, 2007 | 115 | 2007 |
The OpenCL specification, version 2.0 L Howes, A Munshi Khronos Group, 2015 | 91 | 2015 |
Khronos SYCL for OpenCL: a tutorial R Keryell, R Reyes, L Howes Proceedings of the 3rd International Workshop on OpenCL, 1-1, 2015 | 60 | 2015 |
Design space exploration with a stream compiler O Mencer, DJ Pearce, LW Howes, W Luk Proceedings. 2003 IEEE International Conference on Field-Programmable …, 2003 | 56 | 2003 |
Can GPGPU Programming Be Liberated from the Data-Parallel Bottleneck? BR Gaster, L Howes IEEE Computer 45 (8), 42-52, 2012 | 49 | 2012 |
Deriving efficient data movement from decoupled access/execute specifications LW Howes, A Lokhmotov, AF Donaldson, PHJ Kelly High Performance Embedded Architectures and Compilers: Fourth International …, 2009 | 47 | 2009 |
HRF-Relaxed: Adapting HRF to the complexities of industrial heterogeneous memory models BR Gaster, D Hower, L Howes ACM Transactions on Architecture and Code Optimization (TACO) 12 (1), 1-26, 2015 | 45 | 2015 |
Optimized Context Switching for Long-Running Processes LW Howes, BR Gaster, M Mantor US Patent App. 13/691,066, 2014 | 34 | 2014 |
Comparing FPGAs to graphics accelerators and the PlayStation 2 using a unified source description LW Howes, P Price, O Mencer, O Beckmann, O Pell 2006 International Conference on Field Programmable Logic and Applications, 1-6, 2006 | 31 | 2006 |
Heterogeneous Parallel Primitives Programming Model BR Gaster, LW Howes US Patent App. 13/904,791, 2013 | 27 | 2013 |
Method and system for workitem synchronization LW Howes, BR Gaster, MC Houston, M Mantor, M Leather, N Rubin, ... US Patent 8,607,247, 2013 | 27 | 2013 |
Method and system for yield operation supporting thread-like behavior LW Howes, BR Gaster, MC Houston US Patent 9,697,003, 2017 | 19 | 2017 |
Method and system for synchronization of workitems with divergent control flow MC Houston, BR Gaster, LW Howes, M Mantor, D Behr US Patent 9,424,099, 2016 | 16 | 2016 |
Introduction to GPU radix sort T Harada, L Howes Heterogeneous Computing with OpenCL. Morgan Kaufman, 2011 | 15 | 2011 |
Towards metaprogramming for parallel systems on a chip L Howes, A Lokhmotov, AF Donaldson, PHJ Kelly European Conference on Parallel Processing, 36-45, 2009 | 13 | 2009 |
High-performance SIMT code generation in an active visual effects library JLT Cornwall, L Howes, PHJ Kelly, P Parsonage, B Nicoletti Proceedings of the 6th ACM conference on Computing frontiers, 175-184, 2009 | 13 | 2009 |
KMA: A dynamic memory manager for OpenCL R Spliet, L Howes, BR Gaster, AL Varbanescu Proceedings of Workshop on General Purpose Processing Using GPUs, 9-18, 2014 | 12 | 2014 |