Performance portability in reverse time migration and seismic modelling via OpenACC
A Qawasmeh, MR Hugues… - … Journal of High …, 2017 - journals.sagepub.com
Heterogeneity among the computational resources within a single machine has significantly
increased in high performance computing to exploit the tremendous potential of graphics …
increased in high performance computing to exploit the tremendous potential of graphics …
Porting an explicit time-domain volume-integral-equation solver on GPUs with OpenACC [open problems in cem]
Graphics processing units (GPUs) are gradually becoming mainstream in high-performance
computing, as their capabilities for enhancing performance of a large spectrum of scientific …
computing, as their capabilities for enhancing performance of a large spectrum of scientific …
Research on matrix multiplication based on the combination of openacc and cuda
Y Wang - Geo-informatics in Sustainable Ecosystem and Society …, 2019 - Springer
With the improvement of GPU's general computing capacity, the use of parallel computing to
solve some difficult problems with large amount of data and intensive computing tasks has …
solve some difficult problems with large amount of data and intensive computing tasks has …
Historic learning approach for auto-tuning OpenACC accelerated scientific applications
The performance optimization of scientific applications usually requires an in-depth
knowledge of the hardware and software. A performance tuning mechanism is suggested to …
knowledge of the hardware and software. A performance tuning mechanism is suggested to …
[图书][B] Parallel computation with fast algorithms for micromagnetic simulations on GPUs
S Fu - 2016 - search.proquest.com
Micromagnetics is a field of study considering the magnetization behavior in magnetic
materials and devices accounting for a wide set of interactions and describing the …
materials and devices accounting for a wide set of interactions and describing the …
ACCTuner: OpenACC Auto-Tuner For Accelerated Scientific Applications
F Alzayer - 2015 - repository.kaust.edu.sa
We optimize parameters in OpenACC clauses for a stencil evaluation kernel executed on
Graphical Processing Units (GPUs) using a variety of machine learning and optimization …
Graphical Processing Units (GPUs) using a variety of machine learning and optimization …
[PDF][PDF] Porting an Explicit Time-Domain Volume Integral Equation Solver onto Multiple GPUs Using MPI and OpenACC
A scalable parallelization algorithm to port an explicit marching-on-in-time (MOT)-based time
domain volume integral equation (TDVIE) solver onto multi-GPUs is described. The …
domain volume integral equation (TDVIE) solver onto multi-GPUs is described. The …
An Efficient Parallel Algorithm for Simpson Cumulative Integration on GPU
IWA Swardiana, T Wirahman… - 2015 Third International …, 2015 - ieeexplore.ieee.org
In this paper, we present an efficient parallel algorithm for calculating cumulative integration
based on Simpson's rule. The proposed parallel algorithm exploits two Blelloch's prefix …
based on Simpson's rule. The proposed parallel algorithm exploits two Blelloch's prefix …