Animated conversation: rule-based generation of facial expression, gesture & spoken intonation...

J Feine, U Gnewuch, S Morana, A Maedche - International Journal of …, 2019 - Elsevier

Conversational agents (CAs) are software-based systems designed to interact with humans
using natural language and have attracted considerable research interest in recent years …

被引用次数：547 相关文章所有 3 个版本

[PDF] arxiv.org

A Comprehensive Review of Data‐Driven Co‐Speech Gesture Generation

S Nyatsanga, T Kucherenko, C Ahuja… - Computer Graphics …, 2023 - Wiley Online Library

Gestures that accompany speech are an essential part of natural and efficient embodied
human communication. The automatic generation of such co‐speech gestures is a long …

被引用次数：48 相关文章所有 14 个版本

[PDF] thecvf.com

Taming diffusion models for audio-driven co-speech gesture generation

L Zhu, X Liu, X Liu, R Qian, Z Liu… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Animating virtual avatars to make co-speech gestures facilitates various applications in
human-machine interaction. The existing methods mainly rely on generative adversarial …

被引用次数：64 相关文章所有 7 个版本

[PDF] arxiv.org

Gesturediffuclip: Gesture diffusion model with clip latents

T Ao, Z Zhang, L Liu - ACM Transactions on Graphics (TOG), 2023 - dl.acm.org

The automatic generation of stylized co-speech gestures has recently received increasing
attention. Previous systems typically allow style control via predefined text labels or example …

被引用次数：72 相关文章所有 3 个版本

[PDF] thecvf.com

Learning hierarchical cross-modal association for co-speech gesture generation

X Liu, Q Wu, H Zhou, Y Xu, R Qian… - Proceedings of the …, 2022 - openaccess.thecvf.com

Generating speech-consistent body and gesture movements is a long-standing problem in
virtual avatar creation. Previous studies often synthesize pose movement in a holistic …

被引用次数：81 相关文章所有 5 个版本

[PDF] thecvf.com

Learning individual styles of conversational gesture

S Ginosar, A Bar, G Kohavi, C Chan… - Proceedings of the …, 2019 - openaccess.thecvf.com

Human speech is often accompanied by hand and arm gestures. We present a method for
cross-modal translation from" in-the-wild" monologue speech of a single speaker to their …

被引用次数：325 相关文章所有 9 个版本

[PDF] thecvf.com

Learning to listen: Modeling non-deterministic dyadic facial motion

E Ng, H Joo, L Hu, H Li, T Darrell… - Proceedings of the …, 2022 - openaccess.thecvf.com

We present a framework for modeling interactional communication in dyadic conversations:
given multimodal inputs of a speaker, we autoregressively output multiple possibilities of …

被引用次数：63 相关文章所有 5 个版本

[PDF] thecvf.com

Can language models learn to listen?

E Ng, S Subramanian, D Klein… - Proceedings of the …, 2023 - openaccess.thecvf.com

We present a framework for generating appropriate facial responses from a listener in
dyadic social interactions based on the speaker's words. Given an input transcription of the …

被引用次数：11 相关文章所有 5 个版本

[PDF] thecvf.com

Livelyspeaker: Towards semantic-aware co-speech gesture generation

Y Zhi, X Cun, X Chen, X Shen, W Guo… - Proceedings of the …, 2023 - openaccess.thecvf.com

Gestures are non-verbal but important behaviors accompanying people's speech. While
previous methods are able to generate speech rhythm-synchronized gestures, the semantic …

被引用次数：13 相关文章所有 6 个版本

[PDF] thecvf.com

From audio to photoreal embodiment: Synthesizing humans in conversations

E Ng, J Romero, T Bagautdinov, S Bai… - Proceedings of the …, 2024 - openaccess.thecvf.com

We present a framework for generating full-bodied photorealistic avatars that gesture
according to the conversational dynamics of a dyadic interaction. Given speech audio we …

被引用次数：7 相关文章所有 3 个版本