A comprehensive capability analysis of gpt-3 and gpt-3.5 series models J Ye, X Chen, N Xu, C Zu, Z Shao, S Liu, Y Cui, Z Zhou, C Gong, Y Shen, ... arXiv preprint arXiv:2303.10420, 2023 | 135 | 2023 |
Llmeval: A preliminary study on how to evaluate large language models Y Zhang, M Zhang, H Yuan, S Liu, Y Shi, T Gui, Q Zhang, X Huang Proceedings of the AAAI Conference on Artificial Intelligence 38 (17), 19615 …, 2024 | 4 | 2024 |
Training large language models for reasoning through reverse curriculum reinforcement learning Z Xi, W Chen, B Hong, S Jin, R Zheng, W He, Y Ding, S Liu, X Guo, ... arXiv preprint arXiv:2402.05808, 2024 | 3 | 2024 |
Self-Demos: Eliciting Out-of-Demonstration Generalizability in Large Language Models W He, S Liu, J Zhao, Y Ding, Y Lu, Z Xi, T Gui, Q Zhang, X Huang arXiv preprint arXiv:2404.00884, 2024 | | 2024 |