PT-Scotch: A tool for efficient parallel graph ordering

C Chevalier, F Pellegrini - Parallel computing, 2008 - Elsevier
The parallel ordering of large graphs is a difficult problem, because on the one hand
minimum degree algorithms do not parallelize well, and on the other hand the obtainment of …

High-performance finite-element simulations of seismic wave propagation in three-dimensional nonlinear inelastic geological media

F Dupros, F De Martin, E Foerster, D Komatitsch… - Parallel Computing, 2010 - Elsevier
We present finite-element numerical simulations of seismic wave propagation in non linear
inelastic geological media. We demonstrate the feasibility of large-scale modeling based on …

Exploiting intensive multithreading for the efficient simulation of 3D seismic wave propagation

F Dupros, H Aochi, A Ducellier… - 2008 11th IEEE …, 2008 - ieeexplore.ieee.org
Parallel computing is widely used for large scale three-dimensional simulation of seismic
wave propagation. One particularity of most of these simulations is to consider a finite …

Sparse direct solvers with accelerators over DAG runtimes

X Lacoste, P Ramet, M Faverge, J Dongarra - 2012 - inria.hal.science
The current trend in the high performance computing shows a dramatic increase in the
number of cores on the shared memory compute nodes. Algorithms, especially those related …

A parallel tiled solver for dense symmetric indefinite systems on multicore architectures

M Baboulin, D Becker… - 2012 IEEE 26th …, 2012 - ieeexplore.ieee.org
We describe an efficient and innovative parallel tiled algorithm for solving symmetric
indefinite systems on multicore architectures. This solver avoids pivoting by using a …

New scheduling strategies and hybrid programming for a parallel right-looking sparse LU factorization algorithm on multicore cluster systems

I Yamazaki, XS Li - 2012 IEEE 26th International Parallel and …, 2012 - ieeexplore.ieee.org
Parallel sparse LU factorization is a key computational kernel in the solution of a large-scale
linear system of equations. In this paper, we propose two strategies to address some …

Ordonnancement hybride statique-dynamique en algèbre linéaire creuse pour de grands clusters de machines NUMA et multi-coeurs

M Faverge - 2009 - theses.hal.science
Les nouvelles architectures de calcul intensif intègrent de plus en plus de microprocesseurs
qui eux-mêmes intègrent un nombre croissant de cœurs de calcul. Cette multiplication des …

Dynamic scheduling for sparse direct solver on NUMA architectures

M Faverge, P Ramet - PARA'08, 2008 - inria.hal.science
Over the past few years, parallel sparse direct solvers made significant progress and are
now able to efficiently work on problems with several millions of equations. This paper …

A NUMA aware scheduler for a parallel sparse direct solver

M Faverge, P Ramet - Workshop on Massively Multiprocessor and …, 2009 - inria.hal.science
Over the past few years, parallel sparse direct solvers have made significant progress. They
are now able to solve efficiently real-life three-dimensional problems with several millions of …

Fast and reliable solutions for numerical linear algebra solvers in high-performance computing.

M Baboulin - 2012 - theses.hal.science
In this" Habilitation à Diriger des Recherches"(HDR), we present our research in high-
performance scientific computing over the recent years. Our work has been mainly related to …