Outlier suppression: Pushing the limit of low-bit transformer language models X Wei, Y Zhang, X Zhang, R Gong, S Zhang, Q Zhang, F Yu, X Liu Advances in Neural Information Processing Systems 35, 17402-17414, 2022 | 79 | 2022 |
Outlier suppression+: Accurate quantization of large language models by equivalent and optimal shifting and scaling X Wei, Y Zhang, Y Li, X Zhang, R Gong, J Guo, X Liu arXiv preprint arXiv:2304.09145, 2023 | 61 | 2023 |
Teacherlm: Teaching to fish rather than giving the fish, language modeling likewise N He, H Lai, C Zhao, Z Cheng, J Pan, R Qin, R Lu, R Lu, Y Zhang, G Zhao, ... arXiv preprint arXiv:2310.19019, 2023 | 6 | 2023 |
Better few-shot text classification with pre-trained language model Z Chen, Y Zhang Artificial Neural Networks and Machine Learning–ICANN 2021: 30th …, 2021 | 5 | 2021 |
Do Prompts Solve NLP Tasks Using Natural Language? S Yang, Y Zhang, L Cui, Y Zhang arXiv preprint arXiv:2203.00902, 2022 | 3 | 2022 |
Exploring the potential of flexible 8-bit format: Design and algorithm Z Zhang, Y Zhang, G Shi, Y Shen, X Wei, R Gong, X Xia, Q Zhang, L Lu, ... arXiv preprint arXiv:2310.13513, 2023 | 2 | 2023 |
LLM-QBench: A Benchmark Towards the Best Practice for Post-training Quantization of Large Language Models R Gong, Y Yong, S Gu, Y Huang, Y Zhang, X Liu, D Tao arXiv preprint arXiv:2405.06001, 2024 | 1 | 2024 |
SysNoise: Exploring and Benchmarking Training-Deployment System Inconsistency Y Wang, Y Li, R Gong, A Liu, J Hu, Y Yao, Y Zhang, F Yu, X Liu Proceedings of Machine Learning and Systems 5, 2023 | | 2023 |