INFP: Audio-Driven Interactive Head Generation in Dyadic Conversations

Y Zhu, L Zhang, Z Rong, T Hu, S Liang, Z Ge - arXiv preprint arXiv …, 2024 - arxiv.org
Imagine having a conversation with a socially intelligent agent. It can attentively listen to
your words and offer visual and linguistic feedback promptly. This seamless interaction …

Learning and Evaluating Human Preferences for Conversational Head Generation

M Zhou, Y Bai, W Zhang, T Yao, T Zhao… - Proceedings of the 31st …, 2023 - dl.acm.org
A reliable and comprehensive evaluation metric that aligns with manual preference
assessments is crucial for conversational head video synthesis methods development …

Towards Realistic Conversational Head Generation: A Comprehensive Framework for Lifelike Video Synthesis

M Liu, Y Li, S Zhai, W Guan, L Nie - Proceedings of the 31st ACM …, 2023 - dl.acm.org
The Vivid Talking Head Video Generation track of the" ACM Multimedia ViCo 2023
Conversational Head Generation Challenge''aims to generate realistic face-to-face …

Let's Go Real Talk: Spoken Dialogue Model for Face-to-Face Conversation

SJ Park, CW Kim, H Rha, M Kim, J Hong… - arXiv preprint arXiv …, 2024 - arxiv.org
In this paper, we introduce a novel Face-to-Face spoken dialogue model. It processes audio-
visual speech from user input and generates audio-visual speech as the response, marking …

Hierarchical Semantic Perceptual Listener Head Video Generation: A High-performance Pipeline

Z Chang, W Hu, Q Yang, S Zheng - Proceedings of the 31st ACM …, 2023 - dl.acm.org
In dyadic speaker-listener interactions, the listener's head reactions, together with the
speaker's head movements, form an important non-verbal semantic expression. The listener …

DECI: The 2nd Tutorial on Designing Effective Conversational Interfaces

U Gadiraju, K Yadav - Adjunct Proceedings of the 32nd ACM Conference …, 2024 - dl.acm.org
Conversational User Interfaces (CUIs) have been argued to have advantages over
traditional GUIs due to having a more human-like interaction. The growing popularity of …

Improvements on SadTalker-based Approach for ViCo Conversational Head Generation Challenge

W Dai - Proceedings of the 31st ACM International Conference …, 2023 - dl.acm.org
This paper presents our solution in the ACM Multimedia ViCo 2023 Conversational Head
Generation Challenge, which aims to generate vivid face-to-face conversation videos based …

Leveraging WaveNet for Dynamic Listening Head Modeling from Speech

MD Nguyen, HJ Yang, SW Kim, JE Shin… - arXiv preprint arXiv …, 2024 - arxiv.org
The creation of listener facial responses aims to simulate interactive communication
feedback from a listener during a face-to-face conversation. Our goal is to generate …

Generation of Listener's Facial Response Using Cross-Modal Mapping of Speaker's Expression

A Fujii, K Fukuda - International Conference on Human-Computer …, 2024 - Springer
In human communication, we use non-verbal cues such as facial expressions to convey
intentions and emotional states. These non-verbal elements play important roles in …

Decoupling Upper and Lower Face Transformers for Binary Interactive Video Generation

D Yang, Y Liu, Q Yang, R Li - Available at SSRN 5060281 - papers.ssrn.com
The current audio-driven binary interactive methods overlook the uncertain relationship
between the speaker audio and the Dialogue facial movements. To address this issue, we …