[HTML][HTML] Efficient OpenCL system integration of non-blocking FPGA accelerators

T Leppänen, A Lotvonen, P Mousouliotis… - Microprocessors and …, 2023 - Elsevier
OpenCL functions as a portability layer for diverse heterogeneous hardware platforms
including CPUs, GPUs, FPGAs, and hardware accelerators. However, OpenCL programs …

FLIA: Architecture of Collaborated Mobile GPU and FPGA Heterogeneous Computing

N Hu, C Wang, X Zhou - Electronics, 2022 - mdpi.com
Accelerators, such as GPUs (Graphics Processing Unit) that is suitable for handling highly
parallel data, and FPGA (Field Programmable Gate Array) with algorithms customized …

Cross-vendor programming abstraction for diverse heterogeneous platforms

T Leppänen, A Lotvonen… - Frontiers in Computer …, 2022 - frontiersin.org
Hardware specialization is a well-known means to significantly improve the performance
and energy efficiency of various application domains. Modern computing systems consist of …

Mashing load balancing algorithm to boost hybrid kernels in molecular dynamics simulations

R Nozal, JL Bosque - The Journal of Supercomputing, 2023 - Springer
The path to the efficient exploitation of molecular dynamics simulators is strongly driven by
the increasingly intensive use of accelerators. However, they suffer performance portability …

PoCL-R: A scalable low latency distributed OpenCL runtime

J Solanti, M Babej, J Ikkala… - … on Embedded Computer …, 2021 - Springer
Offloading the most demanding parts of applications to an edge GPU server cluster to save
power or improve the result quality is a solution that becomes increasingly realistic with new …

Coopcl: cooperative execution of opencl programs on heterogeneous cpu-gpu platforms

K Moreń, D Göhringer - 2020 28th Euromicro International …, 2020 - ieeexplore.ieee.org
In this work, we present CoopCL, an C++ API and runtime that abstracts and unifies the
cooperative workload execution on multi-core CPU and GPU. The CoopCL takes a OpenCL …

Parallelizing irregular computations for molecular docking

L Solis-Vasquez, D Santos-Martins… - 2020 IEEE/ACM 10th …, 2020 - ieeexplore.ieee.org
AUTODOCK is a molecular docking software widely used in computational drug design. Its
time-consuming executions have motivated the development of AUTODOCK-GPU, an …

[HTML][HTML] OpenCL-like offloading with metaprogramming for SX-Aurora TSUBASA

H Takizawa, S Shiotsuki, N Ebata, R Egawa - Parallel Computing, 2021 - Elsevier
This paper presents an OpenCL-like offload programming framework for NEC SX-Aurora
TSUBASA (SX-Aurora) and also discusses the benefit of employing metaprogramming to …

PoCL-R: An Open Standard Based Offloading Layer for Heterogeneous Multi-Access Edge Computing with Server Side Scalability

J Solanti, M Babej, J Ikkala, P Jääskeläinen - arXiv preprint arXiv …, 2023 - arxiv.org
We propose a novel computing runtime that exposes remote compute devices via the cross-
vendor open heterogeneous computing standard OpenCL and can execute compute tasks …

PySchedCL: Leveraging Concurrency in Heterogeneous Data-Parallel Systems

A Ghose, S Singh, V Kulaharia… - IEEE Transactions …, 2021 - ieeexplore.ieee.org
In the past decade, high performance compute capabilities exhibited by heterogeneous
GPGPU platforms have led to the popularity of data parallel programming languages such …