Efficient Large Language Models: A Survey Z Wan, X Wang, C Liu, S Alam, Y Zheng, Z Qu, S Yan, Y Zhu, Q Zhang, ... TMLR 2024, 2023 | 69 | 2023 |
IoT in the Era of Generative AI: Vision and Challenges X Wang, Z Wan, A Hekmati, M Zong, S Alam, M Zhang, B Krishnamachari IEEE Internet Computing 2024, 2024 | 15 | 2024 |
MEIT: Multi-Modal Electrocardiogram Instruction Tuning on Large Language Models for Report Generation Z Wan, C Liu, X Wang, C Tao, H Shen, Z Peng, J Fu, R Arcucci, H Yao, ... arXiv preprint arXiv:2403.04945, 2024 | 7 | 2024 |
SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression X Wang, Y Zheng, Z Wan, M Zhang arXiv preprint arXiv:2403.07378, 2024 | 6 | 2024 |
Data Stream Clustering: An In-depth Empirical Study X Wang, Z Wang, Z Wu, S Zhang, X Shi, L Lu SIGMOD 2023, 2023 | 5 | 2023 |
D2O: Dynamic Discriminative Operations for Efficient Generative Inference of Large Language Models Z Wan, X Wu, Y Zhang, Y Xin, C Tao, Z Zhu, X Wang, S Luo, J Xiong, ... arXiv preprint arXiv:2406.13035, 2024 | 2 | 2024 |
Famba-V: Fast Vision Mamba with Cross-Layer Token Fusion H Shen, Z Wan, X Wang, M Zhang ECCV 2024 Workshop on Computational Aspects of Deep Learning (Best Paper …, 2024 | | 2024 |
Benne: A Modular and Self-Optimizing Algorithm for Data Stream Clustering Z Wang, X Wang, S Zhang ICDM 2024, 2023 | | 2023 |