Debateqa: Evaluating question answering on debatable knowledge
The rise of large language models (LLMs) has enabled us to seek answers to inherently
debatable questions on LLM chatbots, necessitating a reliable way to evaluate their ability …
debatable questions on LLM chatbots, necessitating a reliable way to evaluate their ability …