Accel-gcn: High-performance gpu accelerator design for graph convolution networks X Xie, H Peng, A Hasan, S Huang, J Zhao, H Fang, W Zhang, T Geng, ... 2023 IEEE/ACM International Conference on Computer Aided Design (ICCAD), 01-09, 2023 | 29 | 2023 |
Autorep: Automatic relu replacement for fast private network inference H Peng, S Huang, T Zhou, Y Luo, C Wang, Z Wang, J Zhao, X Xie, A Li, ... Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 29 | 2023 |
Maxk-gnn: Extremely fast gpu kernel design for accelerating graph neural networks training H Peng, X Xie, K Shivdikar, MA Hasan, J Zhao, S Huang, O Khan, D Kaeli, ... Proceedings of the 29th ACM International Conference on Architectural …, 2024 | 28 | 2024 |
Rrnet: Towards relu-reduced neural network for two-party computation based private inference H Peng, S Zhou, Y Luo, N Xu, S Duan, R Ran, J Zhao, S Huang, X Xie, ... arXiv preprint arXiv:2302.02292, 2023 | 14 | 2023 |
Adapi: Facilitating dnn model adaptivity for efficient private inference in edge computing T Zhou, J Zhao, Y Luo, X Xie, W Wen, C Ding, X Xu arXiv preprint arXiv:2407.05633, 2024 | 9 | 2024 |
MaxK-GNN: Extremely Fast GPU Kernel Design for Accelerating Graph Neural Networks Training H Peng, X Xie, K Shivdikar, MD Hasan, J Zhao, S Huang, O Khan, D Kaeli, ... arXiv preprint arXiv:2312.08656, 2023 | 8 | 2023 |
Advanced language model-driven verilog development: Enhancing power, performance, and area optimization in code synthesis K Thorat, J Zhao, Y Liu, H Peng, X Xie, B Lei, J Zhang, C Ding arXiv preprint arXiv:2312.01022, 2023 | 6 | 2023 |
Advanced Large Language Model (LLM)-Driven Verilog Development: Enhancing Power, Performance, and Area Optimization in Code Synthesis K Thorat, J Zhao, Y Liu, H Peng, X Xie, B Lei, J Zhang, C Ding CoRR, 2023 | 4 | 2023 |
RTop-K: Ultra-Fast Row-Wise Top-K Algorithm and GPU Implementation for Neural Networks X Xie, Y Luo, H Peng, C Ding arXiv preprint arXiv:2409.00822, 2024 | | 2024 |