Benchmarking foundation models with language-model-as-an-examiner Y Bai, J Ying, Y Cao, X Lv, Y He, X Wang, J Yu, K Zeng, Y Xiao, H Lyu, ... Advances in Neural Information Processing Systems 36, 2024 | 60 | 2024 |
Have seen me before? automating dataset updates towards reliable and timely evaluation J Ying, Y Cao, B Wang, W Tang, Y Yang, S Yan arXiv preprint arXiv:2402.11894, 2024 | 3 | 2024 |
A+ B: A General Generator-Reader Framework for Optimizing LLMs to Unleash Synergy Potential W Tang, Y Cao, J Ying, B Wang, Y Zhao, Y Liao, P Zhou arXiv preprint arXiv:2406.03963, 2024 | 2 | 2024 |
Intuitive or Dependent? Investigating LLMs' Robustness to Conflicting Prompts J Ying, Y Cao, K Xiong, Y He, L Cui, Y Liu ACL 2024, 2023 | 2 | 2023 |
LLMs-as-Instructors: Learning from Errors Toward Automating Model Improvement J Ying, M Lin, Y Cao, W Tang, B Wang, Q Sun, X Huang, S Yan arXiv preprint arXiv:2407.00497, 2024 | | 2024 |
QRMeM: Unleash the Length Limitation through Question then Reflection Memory Mechanism B Wang, H Huang, Y Cao, J Ying, W Tang, C Feng arXiv preprint arXiv:2406.13167, 2024 | | 2024 |