Active Testing of Large Language Model via Multi-Stage Sampling
Performance evaluation plays a crucial role in the development life cycle of large language
models (LLMs). It estimates the model's capability, elucidates behavior characteristics, and …
models (LLMs). It estimates the model's capability, elucidates behavior characteristics, and …