On-the-fly elimination of dynamic irregularities for GPU computing EZ Zhang, Y Jiang, Z Guo, K Tian, X Shen ACM SIGARCH Computer Architecture News 39 (1), 369-380, 2011 | 253 | 2011 |
A cross-input adaptive framework for GPU program optimizations Y Liu, EZ Zhang, X Shen 2009 IEEE International Symposium on Parallel & Distributed Processing, 1-10, 2009 | 180 | 2009 |
Does cache sharing on modern CMP matter to the performance of contemporary multithreaded programs? EZ Zhang, Y Jiang, X Shen ACM Sigplan Notices 45 (5), 203-212, 2010 | 155 | 2010 |
Streamlining GPU applications on the fly: thread divergence elimination through runtime thread-data remapping EZ Zhang, Y Jiang, Z Guo, X Shen Proceedings of the 24th ACM International Conference on Supercomputing, 115-126, 2010 | 147 | 2010 |
Complexity analysis and algorithm design for reorganizing data to minimize non-coalesced memory accesses on GPU B Wu, Z Zhao, EZ Zhang, Y Jiang, X Shen Proceedings of the 18th ACM SIGPLAN symposium on Principles and practice of …, 2013 | 127 | 2013 |
Is reuse distance applicable to data locality analysis on chip multiprocessors? Y Jiang, EZ Zhang, K Tian, X Shen International Conference on Compiler Construction, 264-282, 2010 | 122 | 2010 |
Trace data characterization and fitting for Markov modeling G Casale, EZ Zhang, E Smirni Performance Evaluation 67 (2), 61-79, 2010 | 100 | 2010 |
KPC-toolbox: Simple yet effective trace fitting using markovian arrival processes G Casale, EZ Zhang, E Smirni 2008 Fifth International Conference on Quantitative Evaluation of Systems, 83-92, 2008 | 87 | 2008 |
Time-Optimal Qubit Mapping C Zhang, AB Hayes, L Qiu, Y Jin, Y Chen, EZ Zhang Proceedings of The 26th ACM International Conference on Architectural …, 2021 | 68 | 2021 |
New-Sum: A Novel Online ABFT Scheme For General Iterative Methods D Tao, SL Song, S Krishnamoorthy, P Wu, X Liang, EZ Zhang, ... Proceedings of the 25th ACM International Symposium on High-Performance …, 2016 | 54 | 2016 |
Exploiting statistical correlations for proactive prediction of program behaviors Y Jiang, EZ Zhang, K Tian, F Mao, M Gethers, X Shen, Y Gao Proceedings of the 8th annual IEEE/ACM international symposium on Code …, 2010 | 53 | 2010 |
An input-centric paradigm for program dynamic optimizations K Tian, Y Jiang, EZ Zhang, X Shen ACM Sigplan Notices 45 (10), 125-139, 2010 | 52 | 2010 |
A Simple Yet Effective Balanced Edge Partition Model for Parallel Computing L Li, R Geda, AB Hayes, Y Chen, P Chaudhari, EZ Zhang, M Szegedy Proceedings of the ACM on Measurement and Analysis of Computing Systems 1 (1 …, 2017 | 40 | 2017 |
Unified on-chip memory allocation for SIMT architecture AB Hayes, EZ Zhang Proceedings of the 28th ACM international conference on Supercomputing, 293-302, 2014 | 37 | 2014 |
Influence of program inputs on the selection of garbage collectors F Mao, EZ Zhang, X Shen Proceedings of the 2009 ACM SIGPLAN/SIGOPS international conference on …, 2009 | 37 | 2009 |
KernelGen--The Design and Implementation of a Next Generation Compiler Platform for Accelerating Numerical Models on GPUs D Mikushin, N Likhogrud, EZ Zhang, C Bergström 2014 IEEE International Parallel & Distributed Processing Symposium …, 2014 | 36 | 2014 |
CaQR: A Compiler-Assisted Approach for Qubit Reuse through Dynamic Circuit F Hua, Y Jin, Y Chen, S Vittal, K Krsulich, LS Bishop, J Lapeyre, ... Proceedings of the 28th ACM International Conference on Architectural …, 2023 | 34* | 2023 |
Massive atomics for massive parallelism on GPUs IJ Egielski, J Huang, EZ Zhang Proceedings of the 2014 international symposium on Memory management, 93-103, 2014 | 34 | 2014 |
Critical points based register-concurrency autotuning for GPUs A Li, SL Song, A Kumar, EZ Zhang, D Chavarría-Miranda, H Corporaal Proceedings of the 2016 Conference on Design, Automation & Test in Europe …, 2016 | 29 | 2016 |
Correctly treating synchronizations in compiling fine-grained spmd-threaded programs for cpu Z Guo, EZ Zhang, X Shen 2011 International Conference on Parallel Architectures and Compilation …, 2011 | 27 | 2011 |