CHARP: Conversation History AwaReness Probing for Knowledge-grounded Dialogue Systems
In this work, we dive deep into one of the popular knowledge-grounded dialogue
benchmarks that focus on faithfulness, FaithDial. We show that a significant portion of the …
benchmarks that focus on faithfulness, FaithDial. We show that a significant portion of the …
Prompt Perturbation Consistency Learning for Robust Language Models
Large language models (LLMs) have demonstrated impressive performance on a number of
natural language processing tasks, such as question answering and text summarization …
natural language processing tasks, such as question answering and text summarization …