Sadtalker: Learning realistic 3d motion coefficients for stylized audio-driven single image talking face animation
Generating talking head videos through a face image and a piece of speech audio still
contains many challenges. ie, unnatural head movement, distorted expression, and identity …
contains many challenges. ie, unnatural head movement, distorted expression, and identity …
Next3d: Generative neural texture rasterization for 3d-aware head avatars
Abstract 3D-aware generative adversarial networks (GANs) synthesize high-fidelity and
multi-view-consistent facial images using only collections of single-view 2D imagery …
multi-view-consistent facial images using only collections of single-view 2D imagery …
Self-supervised learning of adversarial example: Towards good generalizations for deepfake detection
Recent studies in deepfake detection have yielded promising results when the training and
testing face forgeries are from the same dataset. However, the problem remains challenging …
testing face forgeries are from the same dataset. However, the problem remains challenging …
[HTML][HTML] Talking human face generation: A survey
Talking human face generation aims at synthesizing a natural human face that talks in
correspondence to the given text or audio series. Implementing the recently developed …
correspondence to the given text or audio series. Implementing the recently developed …
Diffused heads: Diffusion models beat gans on talking-face generation
M Stypułkowski, K Vougioukas, S He… - Proceedings of the …, 2024 - openaccess.thecvf.com
Talking face generation has historically struggled to produce head movements and natural
facial expressions without guidance from additional reference videos. Recent developments …
facial expressions without guidance from additional reference videos. Recent developments …
Otavatar: One-shot talking face avatar with controllable tri-plane rendering
Controllability, generalizability and efficiency are the major objectives of constructing face
avatars represented by neural implicit field. However, existing methods have not managed …
avatars represented by neural implicit field. However, existing methods have not managed …
Progressive disentangled representation learning for fine-grained controllable talking head synthesis
We present a novel one-shot talking head synthesis method that achieves disentangled and
fine-grained control over lip motion, eye gaze&blink, head pose, and emotional expression …
fine-grained control over lip motion, eye gaze&blink, head pose, and emotional expression …
Stylesync: High-fidelity generalized and personalized lip sync in style-based generator
Despite recent advances in syncing lip movements with any audio waves, current methods
still struggle to balance generation quality and the model's generalization ability. Previous …
still struggle to balance generation quality and the model's generalization ability. Previous …
Metaportrait: Identity-preserving talking head generation with fast personalized adaptation
In this work, we propose an ID-preserving talking head generation framework, which
advances previous methods in two aspects. First, as opposed to interpolating from sparse …
advances previous methods in two aspects. First, as opposed to interpolating from sparse …
Styletalk: One-shot talking head generation with controllable speaking styles
Different people speak with diverse personalized speaking styles. Although existing one-
shot talking head methods have made significant progress in lip sync, natural facial …
shot talking head methods have made significant progress in lip sync, natural facial …