Expressive speech-driven facial animation

J Xing, M Xia, Y Zhang, X Cun… - Proceedings of the …, 2023 - openaccess.thecvf.com

Speech-driven 3D facial animation has been widely studied, yet there is still a gap to
achieving realism and vividness due to the highly ill-posed nature and scarcity of audio …

被引用次数：92 相关文章所有 8 个版本

[PDF] thecvf.com

Ad-nerf: Audio driven neural radiance fields for talking head synthesis

Y Guo, K Chen, S Liang, YJ Liu… - Proceedings of the …, 2021 - openaccess.thecvf.com

Generating high-fidelity talking head video by fitting with the input audio sequence is a
challenging problem that receives considerable attentions recently. In this paper, we …

被引用次数：327 相关文章所有 7 个版本

[PDF] thecvf.com

Emotalk: Speech-driven emotional disentanglement for 3d face animation

Z Peng, H Wu, Z Song, H Xu, X Zhu… - Proceedings of the …, 2023 - openaccess.thecvf.com

Speech-driven 3D face animation aims to generate realistic facial expressions that match
the speech content and emotion. However, existing methods often neglect emotional facial …

被引用次数：58 相关文章所有 5 个版本

Mead: A large-scale audio-visual dataset for emotional talking-face generation

K Wang, Q Wu, L Song, Z Yang, W Wu, C Qian… - … on Computer Vision, 2020 - Springer

The synthesis of natural emotional reactions is an essential criterion in vivid talking-face
video generation. This criterion is nevertheless seldom taken into consideration in previous …

被引用次数：247 相关文章所有 2 个版本

[PDF] thecvf.com

Meshtalk: 3d face animation from speech using cross-modality disentanglement

A Richard, M Zollhöfer, Y Wen… - Proceedings of the …, 2021 - openaccess.thecvf.com

This paper presents a generic method for generating full facial 3D animation from speech.
Existing approaches to audio-driven facial animation exhibit uncanny or static upper face …

被引用次数：177 相关文章所有 6 个版本

[PDF] wordpress.com

Synthesizing obama: learning lip sync from audio

S Suwajanakorn, SM Seitz… - ACM Transactions on …, 2017 - dl.acm.org

Given audio of President Barack Obama, we synthesize a high quality video of him speaking
with accurate lip sync, composited into a target video clip. Trained on many hours of his …

被引用次数：1270 相关文章所有 3 个版本

[PDF] semanticscholar.org

Audio-driven facial animation by joint end-to-end learning of pose and emotion

T Karras, T Aila, S Laine, A Herva… - ACM Transactions on …, 2017 - dl.acm.org

We present a machine learning technique for driving 3D facial animation by audio input in
real time and with low latency. Our deep neural network learns a mapping from input …

被引用次数：449 相关文章所有 6 个版本

[HTML] springer.com

[HTML][HTML] Realistic speech-driven facial animation with gans

K Vougioukas, S Petridis, M Pantic - International Journal of Computer …, 2020 - Springer

Speech-driven facial animation is the process that automatically synthesizes talking
characters based on speech signals. The majority of work in this domain creates a mapping …

被引用次数：275 相关文章所有 10 个版本

[PDF] uea.ac.uk

A deep learning approach for generalized speech animation

S Taylor, T Kim, Y Yue, M Mahler, J Krahe… - ACM Transactions On …, 2017 - dl.acm.org

We introduce a simple and effective deep learning approach to automatically generate
natural looking speech animation that synchronizes to input speech. Our approach uses a …

被引用次数：325 相关文章所有 18 个版本

[PDF] thecvf.com

Capture, learning, and synthesis of 3D speaking styles

D Cudeiro, T Bolkart, C Laidlaw… - Proceedings of the …, 2019 - openaccess.thecvf.com

Audio-driven 3D facial animation has been widely explored, but achieving realistic, human-
like performance is still unsolved. This is due to the lack of available 3D datasets, models …

被引用次数：336 相关文章所有 11 个版本