CAR: conceptualization-augmented reasoner for zero-shot commonsense question answering

W Wang, T Fang, W Ding, B Xu, X Liu, Y Song… - arXiv preprint arXiv …, 2023 - arxiv.org
The task of zero-shot commonsense question answering evaluates models on their capacity
to reason about general scenarios beyond those presented in specific datasets. Existing …

Acquiring and modeling abstract commonsense knowledge via conceptualization

M He, T Fang, W Wang, Y Song - Artificial Intelligence, 2024 - Elsevier
Conceptualization, or viewing entities and situations as instances of abstract concepts in
mind and making inferences based on that, is a vital component in human intelligence for …

How far are we from believable AI agents? A framework for evaluating the believability of human behavior simulation

Y Xiao, Y Cheng, J Fu, J Wang, W Li, P Liu - arXiv preprint arXiv …, 2023 - arxiv.org
Human behavior simulation of AI agents necessitates the agents to possess a quality of
believability, which is crucial as it facilitates users in establishing trust toward the agents and …

Evaluating very long-term conversational memory of llm agents

A Maharana, DH Lee, S Tulyakov, M Bansal… - arXiv preprint arXiv …, 2024 - arxiv.org
Existing works on long-term open-domain dialogues focus on evaluating model responses
within contexts spanning no more than five chat sessions. Despite advancements in long …

Stark: Social Long-Term Multi-Modal Conversation with Persona Commonsense Knowledge

YJ Lee, D Lee, J Youn, K Oh, B Ko, J Hyeon… - arXiv preprint arXiv …, 2024 - arxiv.org
Humans share a wide variety of images related to their personal experiences within
conversations via instant messaging tools. However, existing works focus on (1) image …

Large Language Models for Human-Robot Interaction: Opportunities and Risks

J Atuhurra - arXiv preprint arXiv:2405.00693, 2024 - arxiv.org
The tremendous development in large language models (LLM) has led to a new wave of
innovations and applications and yielded research results that were initially forecast to take …

Optimizing training data for persona-grounded dialogue via Synthetic Label Augmentation

C Lee, D Kim, W Kim, K Lee, Y Ahn, KH Lee… - Expert Systems with …, 2024 - Elsevier
Persona-grounded dialogue systems aim to enhance the quality of AI agent responses by
bolstering persona consistency and promoting response diversity. Although model tuning …

ComperDial: Commonsense Persona-grounded Dialogue Dataset and Benchmark

H Wakaki, Y Mitsufuji, Y Maeda, Y Nishimura… - arXiv preprint arXiv …, 2024 - arxiv.org
We propose a new benchmark, ComperDial, which facilitates the training and evaluation of
evaluation metrics for open-domain dialogue systems. ComperDial consists of human …

DiffuCOMET: Contextual Commonsense Knowledge Diffusion

S Gao, M Ismayilzada, M Zhao, H Wakaki… - arXiv preprint arXiv …, 2024 - arxiv.org
Inferring contextually-relevant and diverse commonsense to understand narratives remains
challenging for knowledge models. In this work, we develop a series of knowledge models …

PANDA: Persona Attributes Navigation for Detecting and Alleviating Overuse Problem in Large Language Models

J Kim, S Koo, HS Lim - Proceedings of the 2024 Conference on …, 2024 - aclanthology.org
In the persona-grounded dialogue (PGD) task, it is required not only to respond fluently, but
also to ground the attributes according to the current conversation topic properly. However …