Performance portability in reverse time migration and seismic modelling via OpenACC

A Qawasmeh, MR Hugues… - … Journal of High …, 2017 - journals.sagepub.com
Heterogeneity among the computational resources within a single machine has significantly
increased in high performance computing to exploit the tremendous potential of graphics …

Porting an explicit time-domain volume-integral-equation solver on GPUs with OpenACC [open problems in cem]

S Feki, A Al-Jarro, A Clo, H Bagci - IEEE Antennas and …, 2014 - ieeexplore.ieee.org
Graphics processing units (GPUs) are gradually becoming mainstream in high-performance
computing, as their capabilities for enhancing performance of a large spectrum of scientific …

Research on matrix multiplication based on the combination of openacc and cuda

Y Wang - Geo-informatics in Sustainable Ecosystem and Society …, 2019 - Springer
With the improvement of GPU's general computing capacity, the use of parallel computing to
solve some difficult problems with large amount of data and intensive computing tasks has …

Historic learning approach for auto-tuning OpenACC accelerated scientific applications

S Siddiqui, F AlZayer, S Feki - … Conference, Eugene, OR, USA, June 30 …, 2015 - Springer
The performance optimization of scientific applications usually requires an in-depth
knowledge of the hardware and software. A performance tuning mechanism is suggested to …

[图书][B] Parallel computation with fast algorithms for micromagnetic simulations on GPUs

S Fu - 2016 - search.proquest.com
Micromagnetics is a field of study considering the magnetization behavior in magnetic
materials and devices accounting for a wide set of interactions and describing the …

ACCTuner: OpenACC Auto-Tuner For Accelerated Scientific Applications

F Alzayer - 2015 - repository.kaust.edu.sa
We optimize parameters in OpenACC clauses for a stencil evaluation kernel executed on
Graphical Processing Units (GPUs) using a variety of machine learning and optimization …

[PDF][PDF] Porting an Explicit Time-Domain Volume Integral Equation Solver onto Multiple GPUs Using MPI and OpenACC

S Feki, A Al-Jarro, H Bagci - Applied …, 2018 - journals.riverpublishers.com
A scalable parallelization algorithm to port an explicit marching-on-in-time (MOT)-based time
domain volume integral equation (TDVIE) solver onto multi-GPUs is described. The …

An Efficient Parallel Algorithm for Simpson Cumulative Integration on GPU

IWA Swardiana, T Wirahman… - 2015 Third International …, 2015 - ieeexplore.ieee.org
In this paper, we present an efficient parallel algorithm for calculating cumulative integration
based on Simpson's rule. The proposed parallel algorithm exploits two Blelloch's prefix …