Barra: A parallel functional simulator for gpgpu C Collange, M Daumas, D Defour, D Parello 2010 IEEE International Symposium on Modeling, Analysis and Simulation of …, 2010 | 184* | 2010 |
Power consumption of GPUs from a software perspective S Collange, D Defour, A Tisserand Computational Science–ICCS 2009: 9th International Conference Baton Rouge …, 2009 | 175 | 2009 |
Dynamic detection of uniform and affine vectors in GPGPU computations S Collange, D Defour, Y Zhang Euro-Par 2009–Parallel Processing Workshops: HPPC, HeteroPar, PROPER, ROIA …, 2010 | 89 | 2010 |
Numerical reproducibility for the parallel reduction on multi-and many-core architectures C Collange, D Defour, S Graillat, R Iakymchuk Parallel Computing 49, 83-97, 2015 | 74 | 2015 |
CR-LIBM A library of correctly rounded elementary functions in double-precision C Daramy-Loirat, D Defour, F De Dinechin, M Gallet, N Gast, C Lauter, ... LIP,, 2006 | 62 | 2006 |
A new range-reduction algorithm N Brisebarre, D Defour, P Kornerup, JM Muller, N Revol IEEE Transactions on Computers 54 (3), 331-339, 2005 | 47 | 2005 |
A fast chaos-based pseudo-random bit generator using binary64 floating-point arithmetic M François, D Defour, C Negre Informatica 38 (3), 115-124, 2014 | 46 | 2014 |
Proposal for a standardization of mathematical function implementation in floating-point arithmetic D Defour, G Hanrot, V Lefevre, JM Muller, N Revol, P Zimmermann Numerical algorithms 37, 367-375, 2004 | 44 | 2004 |
ExBLAS: Reproducible and accurate BLAS library R Iakymchuk, C Collange, D Defour, S Graillat NRE: Numerical Reproducibility at Exascale, 2015 | 43 | 2015 |
CR-LIBM: A correctly rounded elementary function library C Daramy, D Defour, F De Dinechin, JM Muller Advanced Signal Processing Algorithms, Architectures, and Implementations …, 2003 | 42 | 2003 |
Implementation of float-float operators on graphics hardware G Da Gracca, D Defour arXiv preprint cs/0603115, 2006 | 39 | 2006 |
Fonctions élémentaires: algorithmes et implémentations efficaces pour l'arrondi correct en double précision D Defour Ecole normale supérieure de lyon-ENS LYON, 2003 | 37 | 2003 |
Automatic exploration of reduced floating-point representations in iterative methods Y Chatelain, E Petit, P de Oliveira Castro, G Lartigue, D Defour Euro-Par 2019: Parallel Processing: 25th International Conference on …, 2019 | 34 | 2019 |
Full-speed deterministic bit-accurate parallel floating-point summation on multi-and many-core architectures S Collange, D Defour, S Graillat, R Iakymchuk INRIA, DALI–LIRMM, LIP6, ICS, Tech. Rep. HAL: hal-00949355, 2014 | 34 | 2014 |
Software carry-save for fast multiple-precision algorithms D Defour, F De Dinechin Mathematical Software, 29-39, 2002 | 34 | 2002 |
Using graphics processors for parallelizing hash-based data carving C Collange, YS Dandass, M Daumas, D Defour 2009 42nd Hawaii International Conference on System Sciences, 1-10, 2009 | 33 | 2009 |
Fast correct rounding of elementary functions in double precision using double-extended arithmetic F De Dinechin, D Defour, C Lauter INRIA, LIP, 2004 | 29 | 2004 |
Correctly rounded exponential function in double-precision arithmetic D Defour, F De Dinechin, JM Muller Advanced Signal Processing Algorithms, Architectures, and Implementations XI …, 2001 | 27 | 2001 |
A new scheme for table-based evaluation of functions D Defour, F de Dinechin, JM Muller Conference Record of the Thirty-Sixth Asilomar Conference on Signals …, 2002 | 26 | 2002 |
Cache-optimised methods for the evaluation of elementary functions D Defour Laboratoire de l'informatique du parallélisme, 2002 | 21 | 2002 |