Codetalker: Speech-driven 3d facial animation with discrete motion prior

J Xing, M Xia, Y Zhang, X Cun… - Proceedings of the …, 2023 - openaccess.thecvf.com
Speech-driven 3D facial animation has been widely studied, yet there is still a gap to
achieving realism and vividness due to the highly ill-posed nature and scarcity of audio …

Ad-nerf: Audio driven neural radiance fields for talking head synthesis

Y Guo, K Chen, S Liang, YJ Liu… - Proceedings of the …, 2021 - openaccess.thecvf.com
Generating high-fidelity talking head video by fitting with the input audio sequence is a
challenging problem that receives considerable attentions recently. In this paper, we …

Emotalk: Speech-driven emotional disentanglement for 3d face animation

Z Peng, H Wu, Z Song, H Xu, X Zhu… - Proceedings of the …, 2023 - openaccess.thecvf.com
Speech-driven 3D face animation aims to generate realistic facial expressions that match
the speech content and emotion. However, existing methods often neglect emotional facial …

Mead: A large-scale audio-visual dataset for emotional talking-face generation

K Wang, Q Wu, L Song, Z Yang, W Wu, C Qian… - … on Computer Vision, 2020 - Springer
The synthesis of natural emotional reactions is an essential criterion in vivid talking-face
video generation. This criterion is nevertheless seldom taken into consideration in previous …

Meshtalk: 3d face animation from speech using cross-modality disentanglement

A Richard, M Zollhöfer, Y Wen… - Proceedings of the …, 2021 - openaccess.thecvf.com
This paper presents a generic method for generating full facial 3D animation from speech.
Existing approaches to audio-driven facial animation exhibit uncanny or static upper face …

Synthesizing obama: learning lip sync from audio

S Suwajanakorn, SM Seitz… - ACM Transactions on …, 2017 - dl.acm.org
Given audio of President Barack Obama, we synthesize a high quality video of him speaking
with accurate lip sync, composited into a target video clip. Trained on many hours of his …

Audio-driven facial animation by joint end-to-end learning of pose and emotion

T Karras, T Aila, S Laine, A Herva… - ACM Transactions on …, 2017 - dl.acm.org
We present a machine learning technique for driving 3D facial animation by audio input in
real time and with low latency. Our deep neural network learns a mapping from input …

[HTML][HTML] Realistic speech-driven facial animation with gans

K Vougioukas, S Petridis, M Pantic - International Journal of Computer …, 2020 - Springer
Speech-driven facial animation is the process that automatically synthesizes talking
characters based on speech signals. The majority of work in this domain creates a mapping …

A deep learning approach for generalized speech animation

S Taylor, T Kim, Y Yue, M Mahler, J Krahe… - ACM Transactions On …, 2017 - dl.acm.org
We introduce a simple and effective deep learning approach to automatically generate
natural looking speech animation that synchronizes to input speech. Our approach uses a …

Capture, learning, and synthesis of 3D speaking styles

D Cudeiro, T Bolkart, C Laidlaw… - Proceedings of the …, 2019 - openaccess.thecvf.com
Audio-driven 3D facial animation has been widely explored, but achieving realistic, human-
like performance is still unsolved. This is due to the lack of available 3D datasets, models …