INFP: Audio-Driven Interactive Head Generation in Dyadic Conversations
Y Zhu, L Zhang, Z Rong, T Hu, S Liang, Z Ge - arXiv preprint arXiv …, 2024 - arxiv.org
Imagine having a conversation with a socially intelligent agent. It can attentively listen to
your words and offer visual and linguistic feedback promptly. This seamless interaction …
your words and offer visual and linguistic feedback promptly. This seamless interaction …
Learning and Evaluating Human Preferences for Conversational Head Generation
A reliable and comprehensive evaluation metric that aligns with manual preference
assessments is crucial for conversational head video synthesis methods development …
assessments is crucial for conversational head video synthesis methods development …
Towards Realistic Conversational Head Generation: A Comprehensive Framework for Lifelike Video Synthesis
The Vivid Talking Head Video Generation track of the" ACM Multimedia ViCo 2023
Conversational Head Generation Challenge''aims to generate realistic face-to-face …
Conversational Head Generation Challenge''aims to generate realistic face-to-face …
Let's Go Real Talk: Spoken Dialogue Model for Face-to-Face Conversation
In this paper, we introduce a novel Face-to-Face spoken dialogue model. It processes audio-
visual speech from user input and generates audio-visual speech as the response, marking …
visual speech from user input and generates audio-visual speech as the response, marking …
Hierarchical Semantic Perceptual Listener Head Video Generation: A High-performance Pipeline
In dyadic speaker-listener interactions, the listener's head reactions, together with the
speaker's head movements, form an important non-verbal semantic expression. The listener …
speaker's head movements, form an important non-verbal semantic expression. The listener …
DECI: The 2nd Tutorial on Designing Effective Conversational Interfaces
U Gadiraju, K Yadav - Adjunct Proceedings of the 32nd ACM Conference …, 2024 - dl.acm.org
Conversational User Interfaces (CUIs) have been argued to have advantages over
traditional GUIs due to having a more human-like interaction. The growing popularity of …
traditional GUIs due to having a more human-like interaction. The growing popularity of …
Improvements on SadTalker-based Approach for ViCo Conversational Head Generation Challenge
W Dai - Proceedings of the 31st ACM International Conference …, 2023 - dl.acm.org
This paper presents our solution in the ACM Multimedia ViCo 2023 Conversational Head
Generation Challenge, which aims to generate vivid face-to-face conversation videos based …
Generation Challenge, which aims to generate vivid face-to-face conversation videos based …
Leveraging WaveNet for Dynamic Listening Head Modeling from Speech
The creation of listener facial responses aims to simulate interactive communication
feedback from a listener during a face-to-face conversation. Our goal is to generate …
feedback from a listener during a face-to-face conversation. Our goal is to generate …
Generation of Listener's Facial Response Using Cross-Modal Mapping of Speaker's Expression
A Fujii, K Fukuda - International Conference on Human-Computer …, 2024 - Springer
In human communication, we use non-verbal cues such as facial expressions to convey
intentions and emotional states. These non-verbal elements play important roles in …
intentions and emotional states. These non-verbal elements play important roles in …
Decoupling Upper and Lower Face Transformers for Binary Interactive Video Generation
D Yang, Y Liu, Q Yang, R Li - Available at SSRN 5060281 - papers.ssrn.com
The current audio-driven binary interactive methods overlook the uncertain relationship
between the speaker audio and the Dialogue facial movements. To address this issue, we …
between the speaker audio and the Dialogue facial movements. To address this issue, we …