The rise and potential of large language model based agents: A survey Z Xi, W Chen, X Guo, W He, Y Ding, B Hong, M Zhang, J Wang, S Jin, ... arXiv preprint arXiv:2309.07864, 2023 | 665 | 2023 |
Towards understanding the capability of large language models on code clone detection: a survey S Dou, J Shan, H Jia, W Deng, Z Xi, W He, Y Wu, T Gui, Y Liu, X Huang arXiv preprint arXiv:2308.01191, 2023 | 19 | 2023 |
AgentGym: Evolving Large Language Model-based Agents across Diverse Environments Z Xi, Y Ding, W Chen, B Hong, H Guo, J Wang, D Yang, C Liao, X Guo, ... arXiv preprint arXiv:2406.04151, 2024 | 13 | 2024 |
Training large language models for reasoning through reverse curriculum reinforcement learning Z Xi, W Chen, B Hong, S Jin, R Zheng, W He, Y Ding, S Liu, X Guo, ... arXiv preprint arXiv:2402.05808, 2024 | 10 | 2024 |
LongHeads: Multi-Head Attention is Secretly a Long Context Processor Y Lu, X Zhou, W He, J Zhao, T Ji, T Gui, Q Zhang, X Huang arXiv preprint arXiv:2402.10685, 2024 | 4 | 2024 |
LongAgent: Scaling Language Models to 128k Context through Multi-Agent Collaboration J Zhao, C Zu, H Xu, Y Lu, W He, Y Ding, T Gui, Q Zhang, X Huang arXiv preprint arXiv:2402.11550, 2024 | 2 | 2024 |
TopicAns: Topic-informed Architecture for Answer Recommendation on Technical Q&A Site Y Yang, W He, C Gao, Z Xu, X Xia, C Liu ACM Transactions on Software Engineering and Methodology 33 (1), 1-25, 2023 | 1 | 2023 |
Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision Z Xi, D Yang, J Huang, J Tang, G Li, Y Ding, W He, B Hong, S Do, W Zhan, ... arXiv preprint arXiv:2411.16579, 2024 | | 2024 |
Mitigating Tail Narrowing in LLM Self-Improvement via Socratic-Guided Sampling Y Ding, Z Xi, W He, Z Li, Y Zhai, X Shi, X Cai, T Gui, Q Zhang, X Huang arXiv preprint arXiv:2411.00750, 2024 | | 2024 |
Distill Visual Chart Reasoning Ability from LLMs to MLLMs W He, Z Xi, W Zhao, X Fan, Y Ding, Z Shan, T Gui, Q Zhang, X Huang arXiv preprint arXiv:2410.18798, 2024 | | 2024 |
Self-Demos: Eliciting Out-of-Demonstration Generalizability in Large Language Models W He, S Liu, J Zhao, Y Ding, Y Lu, Z Xi, T Gui, Q Zhang, X Huang arXiv preprint arXiv:2404.00884, 2024 | | 2024 |