The LINPACK benchmark: past, present and future JJ Dongarra, P Luszczek, A Petitet Concurrency and Computation: practice and experience 15 (9), 803-820, 2003 | 1263 | 2003 |
Numerical linear algebra on emerging architectures: The PLASMA and MAGMA projects E Agullo, J Demmel, J Dongarra, B Hadri, J Kurzak, J Langou, H Ltaief, ... Journal of Physics: Conference Series 180 (1), 012037, 2009 | 581 | 2009 |
A new metric for ranking high-performance computing systems J Dongarra, MA Heroux, P Luszczek National Science Review 3 (1), 30-35, 2016 | 483* | 2016 |
From CUDA to OpenCL: Towards a performance-portable solution for multi-platform GPU programming P Du, R Weber, P Luszczek, S Tomov, G Peterson, J Dongarra Parallel Computing 38 (8), 391-407, 2012 | 475 | 2012 |
The HPC Challenge (HPCC) benchmark suite PR Luszczek, DH Bailey, JJ Dongarra, J Kepner, RF Lucas, ... Proceedings of the 2006 ACM/IEEE conference on Supercomputing 213 (10.1145), 1, 2006 | 362 | 2006 |
Introduction to the HPC challenge benchmark suite P Luszczek, JJ Dongarra, D Koester, R Rabenseifner, B Lucas, J Kepner, ... | 273 | 2005 |
Accelerating scientific computations with mixed precision algorithms M Baboulin, A Buttari, J Dongarra, J Kurzak, J Langou, J Langou, ... Computer Physics Communications 180 (12), 2526-2533, 2009 | 272 | 2009 |
Measuring energy and power with PAPI VM Weaver, M Johnson, K Kasichayanula, J Ralph, P Luszczek, ... 2012 41st international conference on parallel processing workshops, 262-268, 2012 | 266 | 2012 |
Flexible development of dense linear algebra algorithms on massively parallel architectures with DPLASMA G Bosilca, A Bouteiller, A Danalis, M Faverge, A Haidar, T Herault, ... 2011 IEEE International Symposium on Parallel and Distributed Processing …, 2011 | 250* | 2011 |
Exploiting the performance of 32 bit floating point arithmetic in obtaining 64 bit accuracy (revisiting iterative refinement for linear systems) J Langou, J Langou, P Luszczek, J Kurzak, A Buttari, J Dongarra Proceedings of the 2006 ACM/IEEE Conference on Supercomputing, 113-es, 2006 | 204 | 2006 |
Mixed precision iterative refinement techniques for the solution of dense linear systems A Buttari, J Dongarra, J Langou, J Langou, P Luszczek, J Kurzak The International Journal of High Performance Computing Applications 21 (4 …, 2007 | 183 | 2007 |
The impact of multicore on math software A Buttari, J Dongarra, J Kurzak, J Langou, P Luszczek, S Tomov International Workshop on Applied Parallel Computing, 1-10, 2006 | 178 | 2006 |
Introduction to the HPC Challenge benchmark suite JJ Dongarra, D Takahashi, D Bailey, D Koester, P Luszczek, ... Lawrence Berkeley National Laboratory 10 (1188455.1188677), 2005 | 157 | 2005 |
Hpcg benchmark: a new metric for ranking high performance computing systems J Dongarra, MA Heroux, P Luszczek Knoxville, Tennessee 42, 2015 | 156 | 2015 |
Using mixed precision for sparse matrix computations to enhance the performance while achieving 64-bit accuracy A Buttari, J Dongarra, J Kurzak, P Luszczek, S Tomov ACM Transactions on Mathematical Software (TOMS) 34 (4), 1-22, 2008 | 151 | 2008 |
Accelerating numerical dense linear algebra calculations with GPUs J Dongarra, M Gates, A Haidar, J Kurzak, P Luszczek, S Tomov, ... Numerical computations with GPUs, 3-28, 2014 | 133 | 2014 |
A survey of numerical linear algebra methods utilizing mixed-precision arithmetic A Abdelfattah, H Anzt, EG Boman, E Carson, T Cojean, J Dongarra, A Fox, ... The International Journal of High Performance Computing Applications 35 (4 …, 2021 | 119 | 2021 |
Power aware computing on GPUs K Kasichayanula, D Terpstra, P Luszczek, S Tomov, S Moore, ... 2012 Symposium on Application Accelerators in High Performance Computing, 64-73, 2012 | 113 | 2012 |
LINPACK Benchmark. JJ Dongarra, P Luszczek Encyclopedia of Parallel Computing, 1033-1036, 2011 | 94 | 2011 |
A rough guide to scientific computing on the playstation 3 A Buttari, P Luszczek, J Kurzak, J Dongarra, G Bosilca version 1.0. Technical Report UT-CS-07-595, Computer Science Department …, 2007 | 93 | 2007 |