Towards dense linear algebra for hybrid GPU accelerated manycore systems S Tomov, J Dongarra, M Baboulin Parallel Computing 36 (5-6), 232-240, 2010 | 591 | 2010 |
Accelerating scientific computations with mixed precision algorithms M Baboulin, A Buttari, J Dongarra, J Kurzak, J Langou, J Langou, ... Computer Physics Communications 180 (12), 2526-2533, 2009 | 274 | 2009 |
High-performance tensor contractions for GPUs A Abdelfattah, M Baboulin, V Dobrev, J Dongarra, C Earl, J Falcou, ... Procedia Computer Science 80, 108-118, 2016 | 73 | 2016 |
High-performance matrix-matrix multiplications of very small matrices I Masliah, A Abdelfattah, A Haidar, S Tomov, M Baboulin, J Falcou, ... Euro-Par 2016: Parallel Processing: 22nd International Conference on …, 2016 | 68 | 2016 |
Some issues in dense linear algebra for multicore and special purpose architectures M Baboulin, J Dongarra, S Tomov Centro de Matemática da Universidade de Coimbra, 2008 | 67 | 2008 |
Accelerating linear system solutions using randomization techniques M Baboulin, J Dongarra, J Herrmann, S Tomov ACM Transactions on Mathematical Software (TOMS) 39 (2), 1-13, 2013 | 55 | 2013 |
A contribution to the conditioning of the total least-squares problem M Baboulin, S Gratton SIAM Journal on Matrix Analysis and Applications 32 (3), 685-699, 2011 | 47 | 2011 |
Collective Mind: Towards Practical and Collaborative Auto‐Tuning G Fursin, R Miceli, A Lokhmotov, M Gerndt, M Baboulin, AD Malony, ... Scientific Programming 22 (4), 309-329, 2014 | 45 | 2014 |
A partial condition number for linear least squares problems M Arioli, M Baboulin, S Gratton SIAM Journal on Matrix Analysis and Applications 29 (2), 413-433, 2007 | 45 | 2007 |
A class of communication-avoiding algorithms for solving general dense linear systems on CPU/GPU parallel machines M Baboulin, S Donfack, J Dongarra, L Grigori, A Rémy, S Tomov Procedia Computer Science 9, 17-26, 2012 | 39 | 2012 |
A parallel solver for incompressible fluid flows Y Wang, M Baboulin, J Dongarra, J Falcou, Y Fraigneau, O Le Maître Procedia Computer Science 18, 439-448, 2013 | 31 | 2013 |
Reducing the amount of pivoting in symmetric indefinite systems D Becker, M Baboulin, J Dongarra Parallel Processing and Applied Mathematics: 9th International Conference …, 2012 | 31 | 2012 |
Using random butterfly transformations to avoid pivoting in sparse direct methods M Baboulin, XS Li, FH Rouet High Performance Computing for Computational Science--VECPAR 2014: 11th …, 2015 | 30 | 2015 |
Parallel tools for solving incremental dense least squares problems: application to space geodesy A Baboulin, L Giraud, S Gratton, J Langou Journal of Algorithms & Computational Technology 3 (1), 117-133, 2009 | 29 | 2009 |
A parallel tiled solver for dense symmetric indefinite systems on multicore architectures M Baboulin, D Becker, J Dongarra 2012 IEEE 26th International Parallel and Distributed Processing Symposium …, 2012 | 28 | 2012 |
Quantum CNOT circuits synthesis for NISQ architectures using the syndrome decoding problem TG de Brugière, M Baboulin, B Valiron, S Martiel, C Allouche Reversible Computation: 12th International Conference, RC 2020, Oslo, Norway …, 2020 | 27 | 2020 |
A parallel distributed solver for large dense symmetric systems: applications to geodesy and electromagnetism problems M Baboulin, L Giraud, S Gratton The International Journal of High Performance Computing Applications 19 (4 …, 2005 | 27 | 2005 |
Computing the conditioning of the components of a linear least‐squares solution M Baboulin, J Dongarra, S Gratton, J Langou Numerical Linear Algebra with Applications 16 (7), 517-533, 2009 | 26 | 2009 |
Reducing the depth of linear reversible quantum circuits TG De Brugiere, M Baboulin, B Valiron, S Martiel, C Allouche IEEE Transactions on Quantum Engineering 2, 1-22, 2021 | 24 | 2021 |
Algorithms and optimization techniques for high-performance matrix-matrix multiplications of very small matrices I Masliah, A Abdelfattah, A Haidar, S Tomov, M Baboulin, J Falcou, ... Parallel Computing 81, 1-21, 2019 | 24 | 2019 |