Lift: a functional data-parallel IR for high-performance GPU code generation M Steuwer, T Remmelg, C Dubach 2017 IEEE/ACM International Symposium on Code Generation and Optimization …, 2017 | 223 | 2017 |
Automatic matching of legacy code to heterogeneous APIs: An idiomatic approach P Ginsbach, T Remmelg, M Steuwer, B Bodin, C Dubach, MFP O'Boyle Proceedings of the Twenty-Third International Conference on Architectural …, 2018 | 42 | 2018 |
Performance portable GPU code generation for matrix multiplication T Remmelg, T Lutz, M Steuwer, C Dubach Proceedings of the 9th Annual Workshop on General Purpose Processing using …, 2016 | 40 | 2016 |
Matrix multiplication beyond auto-tuning: rewrite-based GPU code generation M Steuwer, T Remmelg, C Dubach Proceedings of the International Conference on Compilers, Architectures and …, 2016 | 36 | 2016 |
Runtime code generation and data management for heterogeneous computing in java JJ Fumero, T Remmelg, M Steuwer, C Dubach Proceedings of the principles and practices of programming on the java …, 2015 | 31 | 2015 |
Introducing parallelism to the ranges TS G Brown, C Di Bella, M Haidl, T Remmelg, R Reyes, M Steuwer Proceedings of the International Workshop on OpenCL, 1-5, 2018 | 4 | 2018 |
High-level hardware feature extraction for GPU performance prediction of stencils T Remmelg, B Hagedorn, L Li, M Steuwer, S Gorlatch, C Dubach Proceedings of the 13th Annual Workshop on General Purpose Processing Using …, 2020 | 3 | 2020 |
Automatic performance optimisation of parallel programs for GPUs via rewrite rules T Remmelg The University of Edinburgh, 2019 | 2 | 2019 |
Compiler Optimisations for the Java Virtual Machine T Remmelg | | |