State of the art on monocular 3D face reconstruction, tracking, and applications

M Zollhöfer, J Thies, P Garrido, D Bradley… - Computer graphics …, 2018 - Wiley Online Library
The computer graphics and vision communities have dedicated long standing efforts in
building computerized tools for reconstructing, tracking, and analyzing human faces based …

Gaussian head avatar: Ultra high-fidelity head avatar via dynamic gaussians

Y Xu, B Chen, Z Li, H Zhang, L Wang… - Proceedings of the …, 2024 - openaccess.thecvf.com
Creating high-fidelity 3D head avatars has always been a research hotspot but there
remains a great challenge under lightweight sparse view setups. In this paper we propose …

Codetalker: Speech-driven 3d facial animation with discrete motion prior

J Xing, M Xia, Y Zhang, X Cun… - Proceedings of the …, 2023 - openaccess.thecvf.com
Speech-driven 3D facial animation has been widely studied, yet there is still a gap to
achieving realism and vividness due to the highly ill-posed nature and scarcity of audio …

Im avatar: Implicit morphable head avatars from videos

Y Zheng, VF Abrevaya, MC Bühler… - Proceedings of the …, 2022 - openaccess.thecvf.com
Traditional 3D morphable face models (3DMMs) provide fine-grained control over
expression but cannot easily capture geometric and appearance details. Neural volumetric …

[HTML][HTML] Survey on 3D face reconstruction from uncalibrated images

A Morales, G Piella, FM Sukno - Computer Science Review, 2021 - Elsevier
Recently, a lot of attention has been focused on the incorporation of 3D data into face
analysis and its applications. Despite providing a more accurate representation of the face …

Reconstructing personalized semantic facial nerf models from monocular video

X Gao, C Zhong, J Xiang, Y Hong, Y Guo… - ACM Transactions on …, 2022 - dl.acm.org
We present a novel semantic model for human head defined with neural radiance field. The
3D-consistent head model consist of a set of disentangled and interpretable bases, and can …

Live speech portraits: real-time photorealistic talking-head animation

Y Lu, J Chai, X Cao - ACM Transactions on Graphics (ToG), 2021 - dl.acm.org
To the best of our knowledge, we first present a live system that generates personalized
photorealistic talking-head animation only driven by audio signals at over 30 fps. Our system …

Stylerig: Rigging stylegan for 3d control over portrait images

A Tewari, M Elgharib, G Bharaj… - Proceedings of the …, 2020 - openaccess.thecvf.com
StyleGAN generates photorealistic portrait images of faces with eyes, teeth, hair and context
(neck, shoulders, background), but lacks a rig-like control over semantic face parameters …

Deep video portraits

H Kim, P Garrido, A Tewari, W Xu, J Thies… - ACM transactions on …, 2018 - dl.acm.org
We present a novel approach that enables photo-realistic re-animation of portrait videos
using only an input video. In contrast to existing approaches that are restricted to …

Synthesizing obama: learning lip sync from audio

S Suwajanakorn, SM Seitz… - ACM Transactions on …, 2017 - dl.acm.org
Given audio of President Barack Obama, we synthesize a high quality video of him speaking
with accurate lip sync, composited into a target video clip. Trained on many hours of his …