A communication-optimal framework for contracting distributed tensors S Rajbhandari, A Nikam, PW Lai, K Stock, S Krishnamoorthy, ... SC'14: Proceedings of the International Conference for High Performance …, 2014 | 35 | 2014 |
Accelerating Strassen-Winograd's matrix multiplication algorithm on GPUs PW Lai, H Arafat, V Elango, P Sadayappan 20th Annual international conference on high performance computing, 139-148, 2013 | 31 | 2013 |
A framework for load balancing of tensor contraction expressions via dynamic task partitioning PW Lai, K Stock, S Rajbhandari, S Krishnamoorthy, P Sadayappan Proceedings of the International Conference on High Performance Computing …, 2013 | 30 | 2013 |
A fast implementation of MLR-MCL algorithm on multi-core processors Q Niu, PW Lai, SM Faisal, S Parthasarathy, P Sadayappan 2014 21st International Conference on High Performance Computing (HiPC), 1-10, 2014 | 18 | 2014 |
Framework for distributed contractions of tensors with symmetry S Rajbhandari, A Nikam, PW Lai, K Stock, S Krishnamoorthy, ... Preprint, Ohio State University, 2013 | 14 | 2013 |
Effective utilization of tensor symmetry in operation optimization of tensor contraction expressions PW Lai, H Zhang, S Rajbhandari, E Valeev, K Kowalski, P Sadayappan Procedia Computer Science 9, 412-421, 2012 | 11 | 2012 |
CAST: Contraction algorithm for symmetric tensors S Rajbhandari, A Nikam, PW Lai, K Stock, S Krishnamoorthy, ... 2014 43rd International Conference on Parallel Processing, 261-272, 2014 | 6 | 2014 |
A Framework for Performance Optimization of Tensor Contraction Expressions PW Lai The Ohio State University, 2014 | 1 | 2014 |
Optimization and performance-portable transformation of high level specifications of tensor contraction expressions P Sadayappan, S Krishnamoorthy, PW Lai, LN Pouchet, K Stock, H Zhang ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY 242, 2011 | | 2011 |