A survey on evaluation of large language models Y Chang, X Wang, J Wang, Y Wu, K Zhu, H Chen, L Yang, X Yi, C Wang, ... ACM TIST, 2024 | 846 | 2024 |
Flexmatch: Boosting semi-supervised learning with curriculum pseudo labeling B Zhang*, Y Wang*(Equal Contribution), W Hou, H Wu, J Wang, ... NeurIPS 2021, 2021 | 714 | 2021 |
Freematch: Self-adaptive thresholding for semi-supervised learning Y Wang, H Chen, Q Heng, W Hou, Y Fan, Z Wu, J Wang, M Savvides, ... ICLR 2023, 2023 | 188 | 2023 |
On the robustness of chatgpt: An adversarial and out-of-distribution perspective J Wang, X Hu, W Hou, H Chen, R Zheng, Y Wang, L Yang, H Huang, ... ICLR 2023 workshop on reliable large model, 2023 | 174 | 2023 |
Promptbench: Towards evaluating the robustness of large language models on adversarial prompts K Zhu, J Wang, J Zhou, Z Wang, H Chen, Y Wang, L Yang, W Ye, Y Zhang, ... arXiv preprint arXiv:2306.04528, 2023 | 148 | 2023 |
Softmatch: Addressing the quantity-quality trade-off in semi-supervised learning H Chen, R Tao, Y Fan, Y Wang, J Wang, B Schiele, X Xie, B Raj, ... ICLR 2023, 2023 | 109 | 2023 |
PandaLM: An Automatic Evaluation Benchmark for LLM Instruction Tuning Optimization Y Wang, Z Yu, Z Zeng, L Yang, C Wang, H Chen, C Jiang, R Xie, J Wang, ... ICLR 2024, 2024 | 100 | 2024 |
Usb: A unified semi-supervised learning benchmark for classification Y Wang, H Chen, Y Fan, W Sun, R Tao, W Hou, R Wang, L Yang, Z Zhou, ... NuerIPS 2022, 2022 | 94 | 2022 |
Survey on factuality in large language models: Knowledge, retrieval and domain-specificity C Wang, X Liu, Y Yue, X Tang, T Zhang, C Jiayang, Y Yao, W Gao, X Hu, ... arXiv preprint arXiv:2310.07521, 2023 | 93 | 2023 |
Exploiting adapters for cross-lingual low-resource speech recognition W Hou, H Zhu, Y Wang, J Wang, T Qin, R Xu, T Shinozaki TASLP, 2021 | 54 | 2021 |
GLUE-X: Evaluating Natural Language Understanding Models from an Out-of-distribution Generalization Perspective L Yang, S Zhang, L Qin, Y Li, Y Wang, H Liu, J Wang, X Xie, Y Zhang ACL Findings 2023, 2023 | 52 | 2023 |
Conv-adapter: Exploring parameter efficient transfer learning for convnets H Chen, R Tao, H Zhang, Y Wang, W Ye, J Wang, G Hu, M Savvides CVPR 2024 workshop, 2022 | 36 | 2022 |
Evaluating open-qa evaluation C Wang, S Cheng, Q Guo, Y Yue, B Ding, Z Xu, Y Wang, X Hu, Z Zhang, ... NeurIPS 2023, 2023 | 29* | 2023 |
Meta-adapter: Efficient cross-lingual adaptation with meta-learning W Hou, Y Wang, S Gao, T Shinozaki ICASSP 2021, 2021 | 26 | 2021 |
Margin calibration for long-tailed visual recognition Y Wang, B Zhang, W Hou, Z Wu, J Wang, T Shinozaki ACML 2022, 2022 | 22 | 2022 |
Exploring vision-language models for imbalanced learning Y Wang, Z Yu, J Wang, Q Heng, H Chen, W Ye, R Xie, X Xie, S Zhang International Journal of Computer Vision 132 (1), 224-237, 2023 | 18 | 2023 |
Out-of-Distribution Generalization in Natural Language Processing: Past, Present, and Future L Yang, Y Song, X Ren, C Lyu, Y Wang, J Zhuo, L Liu, J Wang, J Foster, ... EMNLP 2023, 2023 | 9* | 2023 |
KIEval: A Knowledge-grounded Interactive Evaluation Framework for Large Language Models Z Yu, C Gao, W Yao, Y Wang, W Ye, J Wang, X Xie, Y Zhang, S Zhang arXiv preprint arXiv:2402.15043, 2024 | 7 | 2024 |
Towards Optimization and Model Selection for Domain Generalization: A Mixup-guided Solution W Lu, J Wang, Y Wang, X Xie SDM 2024, 2024 | 7 | 2024 |
Imprecise label learning: A unified framework for learning with various imprecise label configurations H Chen, A Shah, J Wang, R Tao, Y Wang, X Xie, M Sugiyama, R Singh, ... arXiv preprint arXiv:2305.12715, 2023 | 7 | 2023 |