关注
Shaoyi Huang
Shaoyi Huang
Assistant Professor, Stevens Institute of Technology
在 stevens.edu 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Accelerating transformer-based deep learning models on fpgas using column balanced block pruning
H Peng, S Huang, T Geng, A Li, W Jiang, H Liu, S Wang, C Ding
2021 22nd International Symposium on Quality Electronic Design (ISQED), 142-148, 2021
962021
A length adaptive algorithm-hardware co-design of transformer on fpga through sparse attention and dynamic pipelining
H Peng*, S Huang*, S Chen, B Li, T Geng, A Li, W Jiang, W Wen, J Bi, ...
Proceedings of the 59th ACM/IEEE Design Automation Conference, 1135-1140, 2022
492022
Accommodating transformer onto fpga: Coupling the balanced model compression and fpga-implementation optimization
P Qi, Y Song, H Peng, S Huang, Q Zhuge, EHM Sha
Proceedings of the 2021 on Great Lakes Symposium on VLSI, 163-168, 2021
482021
Accelerating framework of transformer by hardware design and model compression co-optimization
P Qi, EHM Sha, Q Zhuge, H Peng, S Huang, Z Kong, Y Song, B Li
2021 IEEE/ACM International Conference On Computer Aided Design (ICCAD), 1-9, 2021
442021
Et: re-thinking self-attention for transformer models on gpus
S Chen*, S Huang*, S Pandey, B Li, GR Gao, L Zheng, C Ding, H Liu
Proceedings of the international conference for high performance computing …, 2021
362021
Accel-gcn: High-performance gpu accelerator design for graph convolution networks
X Xie, H Peng, A Hasan, S Huang, J Zhao, H Fang, W Zhang, T Geng, ...
2023 IEEE/ACM International Conference on Computer Aided Design (ICCAD), 01-09, 2023
292023
Autorep: Automatic relu replacement for fast private network inference
H Peng*, S Huang*, T Zhou*, Y Luo, C Wang, Z Wang, J Zhao, X Xie, A Li, ...
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
292023
Sparse progressive distillation: Resolving overfitting under pretrain-and-finetune paradigm
S Huang, D Xu, IEH Yen, S Chang, B Li, S Chen, M Xie, H Liu, C Ding
Proceedings of the 60th Annual Meeting of the Association for Computational …, 2021
292021
Maxk-gnn: Extremely fast gpu kernel design for accelerating graph neural networks training
H Peng, X Xie, K Shivdikar, MA Hasan, J Zhao, S Huang, O Khan, D Kaeli, ...
Proceedings of the 29th ACM International Conference on Architectural …, 2024
282024
Towards sparsification of graph neural networks
H Peng, D Gurevin, S Huang, T Geng, W Jiang, O Khan, C Ding
40th IEEE International Conference on Computer Design (ICCD), 2022
282022
Lingcn: Structural linearized graph convolutional network for homomorphically encrypted inference
H Peng, R Ran, Y Luo, J Zhao, S Huang, K Thorat, T Geng, C Wang, X Xu, ...
Advances in Neural Information Processing Systems 36, 2024
232024
An automatic and efficient BERT pruning for edge AI systems
S Huang, N Liu, Y Liang, H Peng, H Li, D Xu, M Xie, C Ding
2022 23rd International Symposium on Quality Electronic Design (ISQED), 1-6, 2022
152022
Rrnet: Towards relu-reduced neural network for two-party computation based private inference
H Peng, S Zhou, Y Luo, N Xu, S Duan, R Ran, J Zhao, S Huang, X Xie, ...
arXiv preprint arXiv:2302.02292, 2023
142023
HMC-TRAN A Tensor-core Inspired Hierarchical Model Compression for Transformer-based DNNs on GPU
S Huang, S Chen, H Peng, D Manu, Z Kong, G Yuan, L Yang, S Wang, ...
Proceedings of the 2021 on Great Lakes Symposium on VLSI, 169-174, 2021
14*2021
Dynamic sparse training via balancing the exploration-exploitation trade-off
S Huang, B Lei, D Xu, H Peng, Y Sun, M Xie, C Ding
2023 60th ACM/IEEE Design Automation Conference (DAC), 1-6, 2023
132023
Codg-reram: An algorithm-hardware co-design to accelerate semi-structured gnns on reram
Y Luo, P Behnam, K Thorat, Z Liu, H Peng, S Huang, S Zhou, O Khan, ...
2022 IEEE 40th International Conference on Computer Design (ICCD), 280-289, 2022
132022
Co-exploration of graph neural network and network-on-chip design using automl
D Manu, S Huang, C Ding, L Yang
Proceedings of the 2021 on Great Lakes Symposium on VLSI, 175-180, 2021
112021
MaxK-GNN: Extremely Fast GPU Kernel Design for Accelerating Graph Neural Networks Training
H Peng, X Xie, K Shivdikar, MD Hasan, J Zhao, S Huang, O Khan, D Kaeli, ...
arXiv preprint arXiv:2312.08656, 2023
82023
Analyzing and defending against membership inference attacks in natural language processing classification
Y Wang, N Xu, S Huang, K Mahmood, D Guo, C Ding, W Wen, ...
2022 IEEE International Conference on Big Data (Big Data), 5823-5832, 2022
82022
Neurogenesis dynamics-inspired spiking neural network training acceleration
S Huang, H Fang, K Mahmood, B Lei, N Xu, B Lei, Y Sun, D Xu, W Wen, ...
2023 60th ACM/IEEE Design Automation Conference (DAC), 1-6, 2023
62023
系统目前无法执行此操作,请稍后再试。
文章 1–20