CAR: conceptualization-augmented reasoner for zero-shot commonsense question answering
The task of zero-shot commonsense question answering evaluates models on their capacity
to reason about general scenarios beyond those presented in specific datasets. Existing …
to reason about general scenarios beyond those presented in specific datasets. Existing …
Acquiring and modeling abstract commonsense knowledge via conceptualization
Conceptualization, or viewing entities and situations as instances of abstract concepts in
mind and making inferences based on that, is a vital component in human intelligence for …
mind and making inferences based on that, is a vital component in human intelligence for …
How far are we from believable AI agents? A framework for evaluating the believability of human behavior simulation
Human behavior simulation of AI agents necessitates the agents to possess a quality of
believability, which is crucial as it facilitates users in establishing trust toward the agents and …
believability, which is crucial as it facilitates users in establishing trust toward the agents and …
Evaluating very long-term conversational memory of llm agents
Existing works on long-term open-domain dialogues focus on evaluating model responses
within contexts spanning no more than five chat sessions. Despite advancements in long …
within contexts spanning no more than five chat sessions. Despite advancements in long …
Stark: Social Long-Term Multi-Modal Conversation with Persona Commonsense Knowledge
Humans share a wide variety of images related to their personal experiences within
conversations via instant messaging tools. However, existing works focus on (1) image …
conversations via instant messaging tools. However, existing works focus on (1) image …
Large Language Models for Human-Robot Interaction: Opportunities and Risks
J Atuhurra - arXiv preprint arXiv:2405.00693, 2024 - arxiv.org
The tremendous development in large language models (LLM) has led to a new wave of
innovations and applications and yielded research results that were initially forecast to take …
innovations and applications and yielded research results that were initially forecast to take …
Optimizing training data for persona-grounded dialogue via Synthetic Label Augmentation
Persona-grounded dialogue systems aim to enhance the quality of AI agent responses by
bolstering persona consistency and promoting response diversity. Although model tuning …
bolstering persona consistency and promoting response diversity. Although model tuning …
ComperDial: Commonsense Persona-grounded Dialogue Dataset and Benchmark
H Wakaki, Y Mitsufuji, Y Maeda, Y Nishimura… - arXiv preprint arXiv …, 2024 - arxiv.org
We propose a new benchmark, ComperDial, which facilitates the training and evaluation of
evaluation metrics for open-domain dialogue systems. ComperDial consists of human …
evaluation metrics for open-domain dialogue systems. ComperDial consists of human …
DiffuCOMET: Contextual Commonsense Knowledge Diffusion
Inferring contextually-relevant and diverse commonsense to understand narratives remains
challenging for knowledge models. In this work, we develop a series of knowledge models …
challenging for knowledge models. In this work, we develop a series of knowledge models …
PANDA: Persona Attributes Navigation for Detecting and Alleviating Overuse Problem in Large Language Models
In the persona-grounded dialogue (PGD) task, it is required not only to respond fluently, but
also to ground the attributes according to the current conversation topic properly. However …
also to ground the attributes according to the current conversation topic properly. However …