The deep learning compiler: A comprehensive survey M Li, Y Liu, X Liu, Q Sun, X You, H Yang, Z Luan, L Gan, G Yang, D Qian IEEE Transactions on Parallel and Distributed Systems 32 (3), 708-727, 2020 | 196 | 2020 |
swcaffe: A parallel framework for accelerating deep learning applications on sunway taihulight L Li, J Fang, H Fu, J Jiang, W Zhao, C He, X You, G Yang 2018 IEEE International Conference on Cluster Computing (CLUSTER), 413-422, 2018 | 34 | 2018 |
Performance evaluation and analysis of linear algebra kernels in the prototype tianhe-3 cluster X You, H Yang, Z Luan, Y Liu, D Qian Asian Conference on Supercomputing Frontiers, 86-105, 2019 | 21 | 2019 |
Automatic code generation and optimization of large-scale stencil computation on many-core processors M Li, Y Liu, H Yang, Y Hu, Q Sun, B Chen, X You, X Liu, Z Luan, D Qian Proceedings of the 50th International Conference on Parallel Processing, 1-12, 2021 | 15 | 2021 |
ZeroSpy: exploring software inefficiency with redundant zeros X You, H Yang, Z Luan, D Qian, X Liu SC20: International Conference for High Performance Computing, Networking …, 2020 | 10 | 2020 |
DRStencil: Exploiting data reuse within low-order stencil on GPU X You, H Yang, Z Jiang, Z Luan, D Qian 2021 IEEE 23rd Int Conf on High Performance Computing & Communications; 7th …, 2021 | 6 | 2021 |
Towards GPU acceleration of phonon computation with ShengBTE Y Wei, X You, H Yang, Z Luan, D Qian Proceedings of the International Conference on High Performance Computing in …, 2020 | 4 | 2020 |
Vclinic: A portable and efficient framework for fine-grained value profilers X You, H Yang, K Lei, Z Luan, D Qian Proceedings of the 28th ACM International Conference on Architectural …, 2023 | 3 | 2023 |
Vectorizing spmv by exploiting dynamic regular patterns X You, C Liu, H Yang, P Wang, Z Luan, D Qian Proceedings of the 51st International Conference on Parallel Processing, 1-12, 2022 | 3 | 2022 |
Accelerating the cryo-EM structure determination in RELION on GPU cluster X You, H Yang, Z Luan, D Qian Frontiers of Computer Science 16, 1-19, 2022 | 3 | 2022 |
Minions: Accelerating Large Language Model Inference with Adaptive and Collective Speculative Decoding S Wang, H Yang, X Wang, T Liu, P Wang, X Liang, K Ma, T Feng, X You, ... arXiv preprint arXiv:2402.15678, 2024 | 2 | 2024 |
PowerSpector: Towards Energy Efficiency with Calling-Context-Aware Profiling X You, H Yang, Z Xuan, Z Luan, D Qian 2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2022 | 2 | 2022 |
Accelerating De Novo Assembler WTDBG2 on Commodity Servers M Dun, Y Li, X You, Q Sun, Z Luan, H Yang International Conference on Algorithms and Architectures for Parallel …, 2020 | 2 | 2020 |
Performance analysis and optimization of cyro-em structure determination in relion-2 X You, H Yang, Z Luan, D Qian Conference on Advanced Computer Architecture, 195-209, 2018 | 2 | 2018 |
TrivialSpy: Identifying Software Triviality via Fine-grained and Dataflow-based Value Profiling X You, H Yang, K Lei, Z Luan, D Qian Proceedings of the International Conference for High Performance Computing …, 2023 | 1 | 2023 |
dgQuEST: Accelerating Large Scale Quantum Circuit Simulation through Hybrid CPU-GPU Memory Hierarchies T Feng, S Chen, X You, S Zhong, H Yang, Z Luan, D Qian Network and Parallel Computing: 18th IFIP WG 10.3 International Conference …, 2022 | 1 | 2022 |
swGBDT: Efficient Gradient Boosted Decision Tree on Sunway Many-Core Processor B Yin, Y Li, M Dun, X You, H Yang, Z Luan, D Qian Asian Conference on Supercomputing Frontiers, 67-86, 2020 | 1 | 2020 |
L-dag: Enabling loopy workflow in scientific application with automatic dag transformation X You, H Yang, Z Luan, D Qian 2019 IEEE Intl Conf on Dependable, Autonomic and Secure Computing, Intl Conf …, 2019 | 1 | 2019 |
swCaffe: A parallel framework for accelerating deep learning applications on sunway TaihuLight J Fang, L Li, H Fu, J Jiang, W Zhao, C He, X You, G Yang arXiv preprint arXiv:1903.06934, 2019 | 1 | 2019 |
AtRec: Accelerating Recommendation Model Training on CPUs S Wang, T Feng, H Yang, X You, B Chen, T Liu, Z Luan, D Qian IEEE Transactions on Parallel and Distributed Systems, 2024 | | 2024 |