Simplified high level parallelism expression on heterogeneous systems through data partition pattern description

S Wu, X Dong, H Chen, L Wang, Q Wang… - The Computer …, 2023 - academic.oup.com
With the development of heterogeneous systems, the demand for high-level programming
methods that ease heterogeneous programming and produce portable applications has …

An autotuning protocol to rapidly build autotuners

J Liu, G Tan, Y Luo, J Li, Z Mo, N Sun - ACM Transactions on Parallel …, 2019 - dl.acm.org
Automatic performance tuning (Autotuning) is an increasingly critical tuning technique for the
high portable performance of Exascale applications. However, constructing an autotuner …

Toward performance portability for CPUs and GPUs through algorithmic compositions

LW Chang - 2017 - ideals.illinois.edu
The diversity of microarchitecture designs in heterogeneous computing systems allows
programs to achieve high performance and energy efficiency, but results in substantial …