RAJA: Portable Performance for Large-Scale Scientific Applications DA Beckingsale, J Burmark, R Hornung, H Jones, W Killian, AJ Kunen, ... Lawrence Livermore National Lab.(LLNL), Livermore, CA (United States), 2019 | 234 | 2019 |
Scalable I/O-aware job scheduling for burst buffer enabled HPC clusters S Herbein, DH Ahn, D Lipari, TRW Scogland, M Stearman, M Grondona, ... Proceedings of the 25th ACM International Symposium on High-Performance …, 2016 | 95 | 2016 |
Heterogeneous task scheduling for accelerated openmp TRW Scogland, B Rountree, W Feng, BR De Supinski 2012 IEEE 26th International Parallel and Distributed Processing Symposium …, 2012 | 90 | 2012 |
Opencl and the 13 dwarfs: A work in progress W Feng, H Lin, T Scogland, J Zhang Proceedings of the 3rd acm/spec international conference on performance …, 2012 | 84 | 2012 |
Trends in energy-efficient computing: A perspective from the Green500 B Subramaniam, W Saunders, T Scogland, W Feng 2013 International Green Computing Conference Proceedings, 1-8, 2013 | 80 | 2013 |
Flux: Overcoming scheduling challenges for exascale workflows DH Ahn, N Bass, A Chu, J Garlick, M Grondona, S Herbein, HI Ingólfsson, ... Future Generation Computer Systems 110, 202-213, 2020 | 75 | 2020 |
The ongoing evolution of openmp BR de Supinski, TRW Scogland, A Duran, M Klemm, SM Bellido, ... Proceedings of the IEEE 106 (11), 2004-2019, 2018 | 68 | 2018 |
A massively parallel infrastructure for adaptive multiscale simulations: modeling RAS initiation pathway for cancer F Di Natale, H Bhatia, TS Carpenter, C Neale, S Kokkila-Schumacher, ... Proceedings of the International Conference for High Performance Computing …, 2019 | 67 | 2019 |
StreamMR: An Optimized MapReduce Framework for AMD GPUs M Elteir, H Lin, W Feng, T Scogland Parallel and Distributed Systems (ICPADS), 2011 IEEE 17th International …, 2011 | 64 | 2011 |
The green500 list: Year one WC Feng, T Scogland 2009 IEEE International Symposium on Parallel & Distributed Processing, 1-7, 2009 | 61 | 2009 |
Performance portable C++ programming with RAJA (tutorial) D Beckingsale, R Hornung, T Scogland, A Vargas Proceedings of the 24th Symposium on Principles and Practice of Parallel …, 2019 | 41 | 2019 |
A Power-Measurement Methodology for Large-Scale, High-Performance Computing TRW Scogland, CP Steffen, T Wilde, F Parent, S Coghlan, N Bates, ... | 41 | 2014 |
Architecture-Aware Mapping and Optimization on a 1600-Core GPU M Daga, T Scogland, W Feng Parallel and Distributed Systems (ICPADS), 2011 IEEE 17th International …, 2011 | 41 | 2011 |
Asymmetric interactions in symmetric multi-core systems: analysis, enhancements and evaluation T Scogland, P Balaji, W Feng, G Narayanaswamy SC'08: Proceedings of the 2008 ACM/IEEE Conference on Supercomputing, 1-12, 2008 | 38 | 2008 |
Accelerating electrostatic surface potential calculation with multi-scale approximation on graphics processing units R Anandakrishnan, TRW Scogland, AT Fenley, JC Gordon, W Feng, ... Journal of Molecular Graphics and Modelling 28 (8), 904-910, 2010 | 37 | 2010 |
OpenMP application experiences: Porting to accelerated nodes S Bak, C Bertoni, S Boehm, R Budiardja, BM Chapman, J Doerfert, ... Parallel Computing 109, 102856, 2022 | 33 | 2022 |
Directive-based GPU programming for computational fluid dynamics BP Pickering, CW Jackson, TRW Scogland, WC Feng, CJ Roy Computers & Fluids 114, 242-253, 2015 | 32 | 2015 |
A first look at integrated GPUs for green high-performance computing TRW Scogland, H Lin, W Feng Computer Science-Research and Development 25 (3), 125-134, 2010 | 30 | 2010 |
Design and evaluation of scalable concurrent queues for many-core architectures TRW Scogland, W Feng Proceedings of the 6th ACM/SPEC International Conference on Performance …, 2015 | 28 | 2015 |
CoreTSAR: adaptive worksharing for heterogeneous systems TRW Scogland, W Feng, B Rountree, BR de Supinski International Supercomputing Conference, 172-186, 2014 | 27 | 2014 |