关注
Xin You
Xin You
在 buaa.edu.cn 的电子邮件经过验证
标题
引用次数
引用次数
年份
The deep learning compiler: A comprehensive survey
M Li, Y Liu, X Liu, Q Sun, X You, H Yang, Z Luan, L Gan, G Yang, D Qian
IEEE Transactions on Parallel and Distributed Systems 32 (3), 708-727, 2020
1962020
swcaffe: A parallel framework for accelerating deep learning applications on sunway taihulight
L Li, J Fang, H Fu, J Jiang, W Zhao, C He, X You, G Yang
2018 IEEE International Conference on Cluster Computing (CLUSTER), 413-422, 2018
342018
Performance evaluation and analysis of linear algebra kernels in the prototype tianhe-3 cluster
X You, H Yang, Z Luan, Y Liu, D Qian
Asian Conference on Supercomputing Frontiers, 86-105, 2019
212019
Automatic code generation and optimization of large-scale stencil computation on many-core processors
M Li, Y Liu, H Yang, Y Hu, Q Sun, B Chen, X You, X Liu, Z Luan, D Qian
Proceedings of the 50th International Conference on Parallel Processing, 1-12, 2021
152021
ZeroSpy: exploring software inefficiency with redundant zeros
X You, H Yang, Z Luan, D Qian, X Liu
SC20: International Conference for High Performance Computing, Networking …, 2020
102020
DRStencil: Exploiting data reuse within low-order stencil on GPU
X You, H Yang, Z Jiang, Z Luan, D Qian
2021 IEEE 23rd Int Conf on High Performance Computing & Communications; 7th …, 2021
62021
Towards GPU acceleration of phonon computation with ShengBTE
Y Wei, X You, H Yang, Z Luan, D Qian
Proceedings of the International Conference on High Performance Computing in …, 2020
42020
Vclinic: A portable and efficient framework for fine-grained value profilers
X You, H Yang, K Lei, Z Luan, D Qian
Proceedings of the 28th ACM International Conference on Architectural …, 2023
32023
Vectorizing spmv by exploiting dynamic regular patterns
X You, C Liu, H Yang, P Wang, Z Luan, D Qian
Proceedings of the 51st International Conference on Parallel Processing, 1-12, 2022
32022
Accelerating the cryo-EM structure determination in RELION on GPU cluster
X You, H Yang, Z Luan, D Qian
Frontiers of Computer Science 16, 1-19, 2022
32022
Minions: Accelerating Large Language Model Inference with Adaptive and Collective Speculative Decoding
S Wang, H Yang, X Wang, T Liu, P Wang, X Liang, K Ma, T Feng, X You, ...
arXiv preprint arXiv:2402.15678, 2024
22024
PowerSpector: Towards Energy Efficiency with Calling-Context-Aware Profiling
X You, H Yang, Z Xuan, Z Luan, D Qian
2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2022
22022
Accelerating De Novo Assembler WTDBG2 on Commodity Servers
M Dun, Y Li, X You, Q Sun, Z Luan, H Yang
International Conference on Algorithms and Architectures for Parallel …, 2020
22020
Performance analysis and optimization of cyro-em structure determination in relion-2
X You, H Yang, Z Luan, D Qian
Conference on Advanced Computer Architecture, 195-209, 2018
22018
TrivialSpy: Identifying Software Triviality via Fine-grained and Dataflow-based Value Profiling
X You, H Yang, K Lei, Z Luan, D Qian
Proceedings of the International Conference for High Performance Computing …, 2023
12023
dgQuEST: Accelerating Large Scale Quantum Circuit Simulation through Hybrid CPU-GPU Memory Hierarchies
T Feng, S Chen, X You, S Zhong, H Yang, Z Luan, D Qian
Network and Parallel Computing: 18th IFIP WG 10.3 International Conference …, 2022
12022
swGBDT: Efficient Gradient Boosted Decision Tree on Sunway Many-Core Processor
B Yin, Y Li, M Dun, X You, H Yang, Z Luan, D Qian
Asian Conference on Supercomputing Frontiers, 67-86, 2020
12020
L-dag: Enabling loopy workflow in scientific application with automatic dag transformation
X You, H Yang, Z Luan, D Qian
2019 IEEE Intl Conf on Dependable, Autonomic and Secure Computing, Intl Conf …, 2019
12019
swCaffe: A parallel framework for accelerating deep learning applications on sunway TaihuLight
J Fang, L Li, H Fu, J Jiang, W Zhao, C He, X You, G Yang
arXiv preprint arXiv:1903.06934, 2019
12019
AtRec: Accelerating Recommendation Model Training on CPUs
S Wang, T Feng, H Yang, X You, B Chen, T Liu, Z Luan, D Qian
IEEE Transactions on Parallel and Distributed Systems, 2024
2024
系统目前无法执行此操作,请稍后再试。
文章 1–20