The pochoir stencil compiler Y Tang, RA Chowdhury, BC Kuszmaul, CK Luk, CE Leiserson Proceedings of the twenty-third annual ACM symposium on Parallelism in …, 2011 | 453 | 2011 |
Cache-oblivious wavefront: improving parallelism of recursive dynamic programming algorithms without losing cache-efficiency Y Tang, R You, H Kan, JJ Tithi, P Ganapathi, RA Chowdhury Proceedings of the 20th ACM SIGPLAN Symposium on Principles and Practice of …, 2015 | 42 | 2015 |
VNET/P: Bridging the cloud and high performance computing through fast overlay networking L Xia, Z Cui, JR Lange, Y Tang, PA Dinda, PG Bridges Proceedings of the 21st international symposium on High-Performance Parallel …, 2012 | 35 | 2012 |
Autogen: Automatic discovery of cache-oblivious parallel recursive algorithms for solving dynamic programs R Chowdhury, P Ganapathi, JJ Tithi, C Bachmeier, BC Kuszmaul, ... ACM SIGPLAN Notices 51 (8), 1-12, 2016 | 34 | 2016 |
Extending the nested parallel model to the nested dataflow model with provably efficient schedulers D Dinh, HV Simhadri, Y Tang Proceedings of the 28th ACM Symposium on Parallelism in Algorithms and …, 2016 | 24 | 2016 |
Coding Stencil Computations Using the Pochoir {Stencil-Specification} Language Y Tang, R Chowdhury, CK Luk, CE Leiserson | 24 | 2011 |
Provably efficient scheduling of cache-oblivious wavefront algorithms R Chowdhury, P Ganapathi, Y Tang, JJ Tithi Proceedings of the 29th ACM Symposium on Parallelism in Algorithms and …, 2017 | 23 | 2017 |
Autogen: Automatic discovery of efficient recursive divide-8-conquer algorithms for solving dynamic programming problems R Chowdhury, P Ganapathi, S Tschudi, JJ Tithi, C Bachmeier, ... ACM Transactions on Parallel Computing (TOPC) 4 (1), 1-30, 2017 | 17 | 2017 |
Improving parallelism of recursive stencil computations without sacrificing cache performance Y Tang, R You, H Kan, JJ Tithi, P Ganapathi, RA Chowdhury Proceedings of the Second Workshop on Optimizing Stencil Computations, 1-7, 2014 | 11 | 2014 |
Fast VMM-based overlay networking for bridging the cloud and high performance computing L Xia, Z Cui, J Lange, Y Tang, P Dinda, P Bridges Cluster computing 17, 39-59, 2014 | 9 | 2014 |
Weight balancing on boundaries and skeletons L Barba, O Cheong, JL De Carufel, MG Dobbins, R Fleischer, ... Proceedings of the thirtieth annual symposium on Computational geometry, 436-443, 2014 | 7 | 2014 |
Block Size Selection of Parallel LU and QR on PVP-based and RISC-based Supercomputers Y Zhang, Y Chen, Y Tang Proceedings of the 2007 Asian technology information program's (ATIP's) 3rd …, 2007 | 6 | 2007 |
Brief announcement: Star (space-time adaptive and reductive) algorithms for dynamic programming recurrences with more than O (1) dependency Y Tang, S Wang Proceedings of the 29th ACM Symposium on Parallelism in Algorithms and …, 2017 | 5 | 2017 |
VON/KVM: A high performance virtual overlay network integrated with KVM Y Tang, JP Li The 2010 International Conference on Apperceiving Computing and Intelligence …, 2010 | 5 | 2010 |
Balanced partitioning of several cache-oblivious algorithms Y Tang Proceedings of the 32nd ACM Symposium on Parallelism in Algorithms and …, 2020 | 4 | 2020 |
Proposal of MPI operation level checkpoint/rollback and one implementation Y Tang, GE Fagg, JJ Dongarra Sixth IEEE International Symposium on Cluster Computing and the Grid (CCGRID …, 2006 | 4 | 2006 |
New consideration on the evaluation model of cluster area network 唐渊, 孙家昶, 张云泉, 张林波 Journal of Software 16 (6), 1131-1139, 2005 | 3 | 2005 |
CAMPSNA: A cloud assisted mobile peer to peer social network architecture YLH Tang, G Zhao Journal of Digital Information Management 12 (2), 127, 2014 | 2 | 2014 |
Intelligent Fault Diagnosis System in Large Industrial Networks YY Huang, JP Li, FL Xu, Y Tang, J Lin 2008 International Conference on Apperceiving Computing and Intelligence …, 2008 | 2 | 2008 |
Processor-Aware Cache-Oblivious Algorithms✱ Y Tang, W Gao Proceedings of the 50th International Conference on Parallel Processing, 1-10, 2021 | 1 | 2021 |