SpMV and BiCG-Stab optimization for a class of hepta-diagonal-sparse matrices on GPU

MA Al-Mouhamed, AH Khan - The Journal of Supercomputing, 2017 - Springer
The abundant data parallelism available in many-core GPUs has been a key interest to
improve accuracy in scientific and engineering simulation. In many cases, most of the …

Block red–black MILU (0) preconditioner with relaxation on GPU

A Shioya, Y Yamamoto - Parallel Computing, 2021 - Elsevier
To accelerate the Krylov subspace-based linear equation solvers on Graphics Processing
Units (GPUs), a stable, efficient and highly parallel preconditioner is essential. One of the …

[PDF][PDF] An in-depth evaluation of GCC's OpenACC implementation on Cray systems

VGV Larrea, WR Elwasif, O Hernandez, C Philippidis… - 2017 - cug.org
OpenACC is a directive-based API that extends the C/C++ and Fortran base languages to
program accelerators and multicores. Several commercial implementations are available …

Experimental evaluation and enhancement of optimizations of annotation-based and automatic parallel code generators for GPUs

AA Almousa - 2017 - search.proquest.com
GPUs have gained a lot of attention in the HPC community lately. Since that, a lot of
research was done on creating and optimization language models that enable programming …

Comparative Analysis of OpenACC Compilers

D Barba, A Gonzalez-Escribano, DR Llanos - International Conference on …, 2016 - Springer
OpenACC has been on development for a few years now. The OpenACC 2.5 specification
was recently made public and there are some initiatives for developing full implementations …

[PDF][PDF] An in-depth evaluation of GCC's OpenACC implementation in Cray systems

OpenACC is a directive-based API that extends the C/C++ and Fortran base languages to
program accelerators and multicores. Several commercial implementations are available …

Herramientas para la Evaluacion de Compiladores para OpenACC

D Barba Gutiérrez - 2016 - uvadoc.uva.es
Este Trabajo de Fin de Grado presenta TORMENT OpenACC2016, una herramienta de
benchmarking para OpenACC, un nuevo modelo de programación paralela para …

[引用][C] Comparando o Tempo de Execuçao e Consumo de Energia de Aplicaçoes Compiladas com GCC e PGI

LC de Lima, L Pereira, F Rossi, MC Luizelli… - Anais da XX Escola …, 2020 - SBC

[引用][C] A framework for managing shared accelerators in heterogeneous environments

EM O'Neill - 2015 - Queen's University Belfast