- 学术资源搜索

文章

学术资源搜索

获得 1 条结果（用时0.02秒）

Active Testing of Large Language Model via Multi-Stage Sampling

Y Huang, J Song, Q Hu, F Juefei-Xu, L Ma - arXiv preprint arXiv …, 2024 - arxiv.org

Performance evaluation plays a crucial role in the development life cycle of large language
models (LLMs). It estimates the model's capability, elucidates behavior characteristics, and …

Active Testing of Large Language Model via Multi-Stage Sampling

高级搜索

引用