Impact of resource sharing on performance and performance prediction: A survey A Abel, F Benz, J Doerfert, B Dörr, S Hahn, F Haupenthal, M Jacobs, ... CONCUR 2013–Concurrency Theory: 24th International Conference, CONCUR 2013 …, 2013 | 81 | 2013 |
Reverse-mode automatic differentiation and optimization of GPU kernels via Enzyme WS Moses, V Churavy, L Paehler, J Hückelheim, SHK Narayanan, ... Proceedings of the international conference for high performance computing …, 2021 | 52 | 2021 |
Runtime pointer disambiguation P Alves, F Gruber, J Doerfert, A Lamprineas, T Grosser, F Rastello, ... Proceedings of the 2015 ACM SIGPLAN International Conference on Object …, 2015 | 40 | 2015 |
Polly's Polyhedral Scheduling in the Presence of Reductions J Doerfert, K Streit, S Hack, Z Benaissa In 5th International Workshop on Polyhedral Compilation Techniques (IMPACT …, 2015 | 39 | 2015 |
OpenMP application experiences: Porting to accelerated nodes S Bak, C Bertoni, S Boehm, R Budiardja, BM Chapman, J Doerfert, ... Parallel Computing 109, 102856, 2022 | 33 | 2022 |
The TRegion Interface and Compiler Optimizations for OpenMP Target Regions J Doerfert, JMM Diaz, H Finkel OpenMP: Conquering the Full Hardware Spectrum: 15th International Workshop …, 2019 | 32 | 2019 |
Efficient Execution of OpenMP on GPUs J Huber, M Cornelius, G Georgakoudis, S Tian, JM Monsalve Diaz, ... 2022 IEEE/ACM International Symposium on Code Generation and Optimization (CGO), 2022 | 29 | 2022 |
Experience Report: Writing a Portable GPU Runtime with OpenMP 5.1 S Tian, J Chesterfield, J Doerfert, B Chapman OpenMP: Enabling Massive Node-Level Parallelism: 17th International Workshop …, 2021 | 29 | 2021 |
Concurrent execution of deferred OpenMP target tasks with hidden helper threads S Tian, J Doerfert, B Chapman International Workshop on Languages and Compilers for Parallel Computing, 41-56, 2020 | 29 | 2020 |
Optimistic loop optimization J Doerfert, T Grosser, S Hack 2017 IEEE/ACM International Symposium on Code Generation and Optimization …, 2017 | 25 | 2017 |
Generalized task parallelism K Streit, J Doerfert, C Hammacher, A Zeller, S Hack ACM Transactions on Architecture and Code Optimization (TACO) 12 (1), 1-25, 2015 | 22* | 2015 |
Toward Portable GPU Acceleration of the OpenMC Monte Carlo Particle Transport Code JR Tramm, PK Romano, J Doerfert, AL Lund, PC Shriwise, AR Siegel, ... International Conference on Physics of Reactors (PHYSOR), 2022 | 21 | 2022 |
Spolly: speculative optimizations in the polyhedral model J Doerfert, C Hammacher, K Streit, S Hack IMPACT 2013 55, 2013 | 21 | 2013 |
Compiler Optimizations for OpenMP J Doerfert, H Finkel International Workshop on OpenMP, 113-127, 2018 | 20 | 2018 |
Breaking the vendor lock: Performance portable programming through openmp as target independent runtime layer J Doerfert, M Jasper, J Huber, K Abdelaal, G Georgakoudis, T Scogland, ... Proceedings of the International Conference on Parallel Architectures and …, 2022 | 18 | 2022 |
Input Space Splitting for OpenCL S Moll, J Doerfert, S Hack Proceedings of the 25th International Conference on Compiler Construction …, 2016 | 18 | 2016 |
Remote OpenMP Offloading A Patel, J Doerfert ISC High-Performance, 2022 | 17 | 2022 |
Co-Designing an OpenMP GPU Runtime and Optimizations for Near-Zero Overhead Execution J Doerfert, A Patel, J Huber, S Tian, JM Monsalve Diaz, B Chapman, ... 2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2022 | 17 | 2022 |
A virtual GPU as developer-friendly OpenMP offload target A Patel, S Tian, J Doerfert, B Chapman 50th International Conference on Parallel Processing Workshop, 1-7, 2021 | 16 | 2021 |
Architecture-parametric timing analysis J Reineke, J Doerfert 2014 IEEE 19th Real-Time and Embedded Technology and Applications Symposium …, 2014 | 13 | 2014 |