Post-training Quantization on Diffusion Models Y Shang, Z Yuan, B Xie, B Wu, Y Yan CVPR, 2023 | 69 | 2023 |
Rptq: Reorder-based post-training quantization for large language models Z Yuan, L Niu, J Liu, W Liu, X Wang, Y Shang, G Sun, Q Wu, J Wu, B Wu arXiv preprint arXiv:2304.01089, 2023 | 49 | 2023 |
Lipschitz Continuity Guided Knowledge Distillation Y Shang, B Duan, Z Zong, L Nie, Y Yan ICCV, 2021 | 31 | 2021 |
Network Binarization via Contrastive Learning Y Shang, X Dan, Z Zong, L Nie, Y Yan ECCV, 2022 | 24 | 2022 |
Pb-llm: Partially binarized large language models Y Shang*, Z Yuan*, Q Wu, Z Dong arXiv preprint arXiv:2310.00034, 2023 | 20 | 2023 |
Lipschitz Continuity Retained Binary Neural Network Y Shang, X Dan, B Duan, Z Zong, L Nie, Y Yan ECCV, 2022 | 19 | 2022 |
LLM Inference Unveiled: Survey and Roofline Model Insights Z Yuan*, Y Shang*, Y Zhou*, Z Dong, C Xue, B Wu, Z Li, Q Gu, YJ Lee, ... arXiv preprint arXiv:2402.16363, 2024 | 12 | 2024 |
ASVD: Activation-aware Singular Value Decomposition for Compressing Large Language Models Z Yuan*, Y Shang*, Y Song, Q Wu, Y Yan, G Sun arXiv preprint arXiv:2312.05821, 2023 | 10 | 2023 |
Llava-prumerge: Adaptive token reduction for efficient large multimodal models Y Shang, M Cai, B Xu, YJ Lee, Y Yan arXiv preprint arXiv:2403.15388, 2024 | 7 | 2024 |
Quest: Low-bit diffusion model quantization via efficient selective finetuning H Wang, Y Shang, Z Yuan, J Wu, Y Yan arXiv preprint arXiv:2402.03666, 2024 | 3 | 2024 |
MIM4DD: Mutual Information Maximization for Dataset Distillation Y Shang, Z Yuan, Y Yan NeurIPS, 2023 | 3 | 2023 |
Causal-DFQ: Causality Guided Data-free Network Quantization Y Shang, B Xu, G Liu, R Kompella, Y Yan ICCV, 2023 | 3 | 2023 |
Bpt: binary point cloud transformer for place recognition Z Hou, Y Shang, T Gao, Y Yan arXiv preprint arXiv:2303.01166, 2023 | 2 | 2023 |
Supplementing Missing Visions via Dialog for Scene Graph Generations Z Zhao, Y Zhu, X Zhu, Y Shang, Y Yan ICASSP, 2024 | 1 | 2024 |
Network specialization via feature-level knowledge distillation G Liu, Y Shang, Y Yao, R Kompella CVPR-Workshop, 2023 | 1 | 2023 |
Dataset Quantization with Active Learning based Adaptive Sampling Z Zhao, Y Shang, J Wu, Y Yan ECCV, 2024 | | 2024 |
Enhancing Post-training Quantization Calibration through Contrastive Learning Y Shang, G Liu, RR Kompella, Y Yan CVPR, 2024 | | 2024 |
Efficient Multitask Dense Predictor via Binarization Y Shang, D Xu, G Liu, RR Kompella, Y Yan CVPR, 2024 | | 2024 |
A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training K Wang, Y Zhou, M Shi, Z Yuan, Y Shang, X Peng, H Zhang, Y You arXiv preprint arXiv:2405.17403, 2024 | | 2024 |
PTQ4DiT: Post-training Quantization for Diffusion Transformers J Wu, H Wang, Y Shang, M Shah, Y Yan arXiv preprint arXiv:2405.16005, 2024 | | 2024 |