Deep learning for visual speech analysis: A survey
Visual speech, referring to the visual domain of speech, has attracted increasing attention
due to its wide applications, such as public security, medical treatment, military defense, and …
due to its wide applications, such as public security, medical treatment, military defense, and …
Sadtalker: Learning realistic 3d motion coefficients for stylized audio-driven single image talking face animation
Generating talking head videos through a face image and a piece of speech audio still
contains many challenges. ie, unnatural head movement, distorted expression, and identity …
contains many challenges. ie, unnatural head movement, distorted expression, and identity …
Metaportrait: Identity-preserving talking head generation with fast personalized adaptation
In this work, we propose an ID-preserving talking head generation framework, which
advances previous methods in two aspects. First, as opposed to interpolating from sparse …
advances previous methods in two aspects. First, as opposed to interpolating from sparse …
Dpe: Disentanglement of pose and expression for general video portrait editing
One-shot video-driven talking face generation aims at producing a synthetic talking video by
transferring the facial motion from a video to an arbitrary portrait image. Head pose and …
transferring the facial motion from a video to an arbitrary portrait image. Head pose and …
Diffposetalk: Speech-driven stylistic 3d facial animation and head pose generation via diffusion models
The generation of stylistic 3D facial animations driven by speech presents a significant
challenge as it requires learning a many-to-many mapping between speech, style, and the …
challenge as it requires learning a many-to-many mapping between speech, style, and the …
Application of a 3D Talking Head as Part of Telecommunication AR, VR, MR System: Systematic Review
In today's digital era, the realms of virtual reality (VR), augmented reality (AR), and mixed
reality (MR) collectively referred to as extended reality (XR) are reshaping human–computer …
reality (MR) collectively referred to as extended reality (XR) are reshaping human–computer …
Diffsheg: A diffusion-based approach for real-time speech-driven holistic 3d expression and gesture generation
Abstract We propose DiffSHEG a Diffusion-based approach for Speech-driven Holistic 3D
Expression and Gesture generation. While previous works focused on co-speech gesture or …
Expression and Gesture generation. While previous works focused on co-speech gesture or …
Emotional speech-driven animation with content-emotion disentanglement
To be widely adopted, 3D facial avatars must be animated easily, realistically, and directly
from speech signals. While the best recent methods generate 3D animations that are …
from speech signals. While the best recent methods generate 3D animations that are …
Facetalk: Audio-driven motion diffusion for neural parametric head models
We introduce FaceTalk a novel generative approach designed for synthesizing high-fidelity
3D motion sequences of talking human heads from input audio signal. To capture the …
3D motion sequences of talking human heads from input audio signal. To capture the …
ToonTalker: Cross-domain face reenactment
We target cross-domain face reenactment in this paper, ie, driving a cartoon image with the
video of a real person and vice versa. Recently, many works have focused on one-shot …
video of a real person and vice versa. Recently, many works have focused on one-shot …