Pre-trained models for natural language processing: A survey X Qiu, T Sun, Y Xu, Y Shao, N Dai, X Huang Science China technological sciences 63 (10), 1872-1897, 2020 | 1639 | 2020 |
Black-Box Tuning for Language-Model-as-a-Service T Sun, Y Shao, H Qian, X Huang, X Qiu ICML 2022, 2022 | 190 | 2022 |
CoLAKE: Contextualized Language and Knowledge Embedding T Sun, Y Shao, X Qiu, Q Guo, Y Hu, X Huang, Z Zhang COLING 2020, 2020 | 175 | 2020 |
Does syntax matter? A strong baseline for Aspect-based Sentiment Analysis with RoBERTa J Dai, H Yan, T Sun, P Liu, X Qiu NAACL 2021, 2021 | 148 | 2021 |
Learning sparse sharing architectures for multiple tasks T Sun, Y Shao, X Li, P Liu, H Yan, X Qiu, X Huang AAAI 2020, 2020 | 140 | 2020 |
Paradigm shift in natural language processing T Sun, X Liu, X Qiu, X Huang Machine Intelligence Research 19 (3), 169–183, 2022 | 98 | 2022 |
BBTv2: Towards a Gradient-Free Future with Large Language Models T Sun, Z He, H Qian, Y Zhou, X Huang, X Qiu EMNLP 2022, 2022 | 69* | 2022 |
DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models Z He*, T Sun*, K Wang, X Huang, X Qiu ACL 2023, 2022 | 64 | 2022 |
Secrets of rlhf in large language models part i: Ppo R Zheng, S Dou, S Gao, Y Hua, W Shen, B Wang, Y Liu, S Jin, Q Liu, ... arXiv preprint arXiv:2307.04964, 2023 | 53 | 2023 |
MOSS: An Open Conversational Large Language Model T Sun, X Zhang, Z He, P Li, Q Cheng, H Yan, X Liu, Y Shao, Q Tang, ... Machine Intelligence Research, 2024 | 47* | 2024 |
Towards Efficient NLP: A Standard Evaluation and A Strong Baseline X Liu*, T Sun*, J He, L Wu, X Zhang, H Jiang, Z Cao, X Huang, X Qiu NAACL 2022, 2021 | 39 | 2021 |
CodeIE: Large Code Generation Models are Better Few-Shot Information Extractors P Li*, T Sun*, Q Tang, H Yan, Y Wu, X Huang, X Qiu ACL 2023, 2023 | 36 | 2023 |
BERTScore is Unfair: On Social Bias in Language Model-Based Metrics for Text Generation T Sun, J He, X Qiu, X Huang EMNLP 2022, 2022 | 35 | 2022 |
A Simple Hash-Based Early Exiting Approach For Language Understanding and Generation T Sun, X Liu, W Zhu, Z Geng, L Wu, Y He, Y Ni, G Xie, X Huang, X Qiu ACL 2022 (findings), 2022 | 33 | 2022 |
Accelerating BERT Inference for Sequence Labeling via Early-Exit X Li, Y Shao, T Sun, H Yan, X Qiu, X Huang ACL 2021, 2021 | 29 | 2021 |
Improving Contrastive Learning of Sentence Embeddings from AI Feedback Q Cheng, X Yang, T Sun, L Li, X Qiu ACL 2023 (findings), 2023 | 28 | 2023 |
Early exiting with ensemble internal classifiers T Sun, Y Zhou, X Liu, X Zhang, H Jiang, Z Cao, X Huang, X Qiu arXiv preprint arXiv:2105.13792, 2021 | 28 | 2021 |
AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling J Zhan, J Dai, J Ye, Y Zhou, D Zhang, Z Liu, X Zhang, R Yuan, G Zhang, ... ACL 2024, 2024 | 21 | 2024 |
Evaluating hallucinations in chinese large language models Q Cheng, T Sun, W Zhang, S Wang, X Liu, M Zhang, J He, M Huang, Z Yin, ... arXiv preprint arXiv:2310.03368, 2023 | 19 | 2023 |
Multitask Pre-training of Modular Prompt for Chinese Few-Shot Learning T Sun, Z He, Q Zhu, X Qiu, X Huang ACL 2023, 2022 | 19* | 2022 |