Sadtalker: Learning realistic 3d motion coefficients for stylized audio-driven single image talking face animation

W Zhang, X Cun, X Wang, Y Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Generating talking head videos through a face image and a piece of speech audio still
contains many challenges. ie, unnatural head movement, distorted expression, and identity …

Conditional image-to-video generation with latent flow diffusion models

H Ni, C Shi, K Li, SX Huang… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Conditional image-to-video (cI2V) generation aims to synthesize a new plausible video
starting from an image (eg, a person's face) and a condition (eg, an action class label like …

Animate anyone: Consistent and controllable image-to-video synthesis for character animation

L Hu - Proceedings of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
Character Animation aims to generating character videos from still images through driving
signals. Currently diffusion models have become the mainstream in visual generation …

Magicanimate: Temporally consistent human image animation using diffusion model

Z Xu, J Zhang, JH Liew, H Yan, JW Liu… - Proceedings of the …, 2024 - openaccess.thecvf.com
This paper studies the human image animation task which aims to generate a video of a
certain reference identity following a particular motion sequence. Existing animation works …

Generative image dynamics

Z Li, R Tucker, N Snavely… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
We present an approach to modeling an image-space prior on scene motion. Our prior is
learned from a collection of motion trajectories extracted from real video sequences …

Follow your pose: Pose-guided text-to-video generation using pose-free videos

Y Ma, Y He, X Cun, X Wang, S Chen, X Li… - Proceedings of the AAAI …, 2024 - ojs.aaai.org
Generating text-editable and pose-controllable character videos have an imperious demand
in creating various digital human. Nevertheless, this task has been restricted by the absence …

One-stage 3d whole-body mesh recovery with component aware transformer

J Lin, A Zeng, H Wang, L Zhang… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Whole-body mesh recovery aims to estimate the 3D human body, face, and hands
parameters from a single image. It is challenging to perform this task with a single network …

Thin-plate spline motion model for image animation

J Zhao, H Zhang - … of the IEEE/CVF Conference on …, 2022 - openaccess.thecvf.com
Image animation brings life to the static object in the source image according to the driving
video. Recent works attempt to perform motion transfer on arbitrary objects through …

Styleheat: One-shot high-resolution editable talking face generation via pre-trained stylegan

F Yin, Y Zhang, X Cun, M Cao, Y Fan, X Wang… - European conference on …, 2022 - Springer
One-shot talking face generation aims at synthesizing a high-quality talking face video from
an arbitrary portrait image, driven by a video or an audio segment. In this work, we provide a …

Deepfakes as a threat to a speaker and facial recognition: An overview of tools and attack vectors

A Firc, K Malinka, P Hanáček - Heliyon, 2023 - cell.com
Deepfakes present an emerging threat in cyberspace. Recent developments in machine
learning make deepfakes highly believable, and very difficult to differentiate between what is …