Interactive conversational head generation

Y Zhu, L Zhang, Z Rong, T Hu, S Liang, Z Ge - arXiv preprint arXiv …, 2024 - arxiv.org

Imagine having a conversation with a socially intelligent agent. It can attentively listen to
your words and offer visual and linguistic feedback promptly. This seamless interaction …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

Learning and Evaluating Human Preferences for Conversational Head Generation

M Zhou, Y Bai, W Zhang, T Yao, T Zhao… - Proceedings of the 31st …, 2023 - dl.acm.org

A reliable and comprehensive evaluation metric that aligns with manual preference
assessments is crucial for conversational head video synthesis methods development …

被引用次数：2 相关文章所有 3 个版本

[PDF] archive.org

Towards Realistic Conversational Head Generation: A Comprehensive Framework for Lifelike Video Synthesis

M Liu, Y Li, S Zhai, W Guan, L Nie - Proceedings of the 31st ACM …, 2023 - dl.acm.org

The Vivid Talking Head Video Generation track of the" ACM Multimedia ViCo 2023
Conversational Head Generation Challenge''aims to generate realistic face-to-face …

Let's Go Real Talk: Spoken Dialogue Model for Face-to-Face Conversation

SJ Park, CW Kim, H Rha, M Kim, J Hong… - arXiv preprint arXiv …, 2024 - arxiv.org

In this paper, we introduce a novel Face-to-Face spoken dialogue model. It processes audio-
visual speech from user input and generates audio-visual speech as the response, marking …

被引用次数：5 相关文章所有 2 个版本

[PDF] arxiv.org

Hierarchical Semantic Perceptual Listener Head Video Generation: A High-performance Pipeline

Z Chang, W Hu, Q Yang, S Zheng - Proceedings of the 31st ACM …, 2023 - dl.acm.org

In dyadic speaker-listener interactions, the listener's head reactions, together with the
speaker's head movements, form an important non-verbal semantic expression. The listener …

被引用次数：5 相关文章所有 3 个版本

[PDF] acm.org

DECI: The 2nd Tutorial on Designing Effective Conversational Interfaces

U Gadiraju, K Yadav - Adjunct Proceedings of the 32nd ACM Conference …, 2024 - dl.acm.org

Conversational User Interfaces (CUIs) have been argued to have advantages over
traditional GUIs due to having a more human-like interaction. The growing popularity of …

Improvements on SadTalker-based Approach for ViCo Conversational Head Generation Challenge

W Dai - Proceedings of the 31st ACM International Conference …, 2023 - dl.acm.org

This paper presents our solution in the ACM Multimedia ViCo 2023 Conversational Head
Generation Challenge, which aims to generate vivid face-to-face conversation videos based …

[PDF] arxiv.org

Leveraging WaveNet for Dynamic Listening Head Modeling from Speech

MD Nguyen, HJ Yang, SW Kim, JE Shin… - arXiv preprint arXiv …, 2024 - arxiv.org

The creation of listener facial responses aims to simulate interactive communication
feedback from a listener during a face-to-face conversation. Our goal is to generate …

Generation of Listener's Facial Response Using Cross-Modal Mapping of Speaker's Expression

A Fujii, K Fukuda - International Conference on Human-Computer …, 2024 - Springer

In human communication, we use non-verbal cues such as facial expressions to convey
intentions and emotional states. These non-verbal elements play important roles in …

[PDF] ssrn.com

Decoupling Upper and Lower Face Transformers for Binary Interactive Video Generation

D Yang, Y Liu, Q Yang, R Li - Available at SSRN 5060281 - papers.ssrn.com

The current audio-driven binary interactive methods overlook the uncertain relationship
between the speaker audio and the Dialogue facial movements. To address this issue, we …