- 学术资源搜索

Sadtalker: Learning realistic 3d motion coefficients for stylized audio-driven single image talking face animation

W Zhang, X Cun, X Wang, Y Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com

Generating talking head videos through a face image and a piece of speech audio still
contains many challenges. ie, unnatural head movement, distorted expression, and identity …

被引用次数：151 相关文章所有 7 个版本

[PDF] thecvf.com

Conditional image-to-video generation with latent flow diffusion models

H Ni, C Shi, K Li, SX Huang… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Conditional image-to-video (cI2V) generation aims to synthesize a new plausible video
starting from an image (eg, a person's face) and a condition (eg, an action class label like …

被引用次数：98 相关文章所有 6 个版本

[PDF] thecvf.com

Animate anyone: Consistent and controllable image-to-video synthesis for character animation

L Hu - Proceedings of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com

Character Animation aims to generating character videos from still images through driving
signals. Currently diffusion models have become the mainstream in visual generation …

被引用次数：105 相关文章所有 3 个版本

[PDF] thecvf.com

Magicanimate: Temporally consistent human image animation using diffusion model

Z Xu, J Zhang, JH Liew, H Yan, JW Liu… - Proceedings of the …, 2024 - openaccess.thecvf.com

This paper studies the human image animation task which aims to generate a video of a
certain reference identity following a particular motion sequence. Existing animation works …

被引用次数：67 相关文章所有 3 个版本

[PDF] thecvf.com

Generative image dynamics

Z Li, R Tucker, N Snavely… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com

We present an approach to modeling an image-space prior on scene motion. Our prior is
learned from a collection of motion trajectories extracted from real video sequences …

被引用次数：34 相关文章所有 9 个版本

[PDF] aaai.org

Follow your pose: Pose-guided text-to-video generation using pose-free videos

Y Ma, Y He, X Cun, X Wang, S Chen, X Li… - Proceedings of the AAAI …, 2024 - ojs.aaai.org

Generating text-editable and pose-controllable character videos have an imperious demand
in creating various digital human. Nevertheless, this task has been restricted by the absence …

被引用次数：86 相关文章所有 3 个版本

[PDF] thecvf.com

One-stage 3d whole-body mesh recovery with component aware transformer

J Lin, A Zeng, H Wang, L Zhang… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Whole-body mesh recovery aims to estimate the 3D human body, face, and hands
parameters from a single image. It is challenging to perform this task with a single network …

被引用次数：65 相关文章所有 6 个版本

[PDF] thecvf.com

Thin-plate spline motion model for image animation

J Zhao, H Zhang - … of the IEEE/CVF Conference on …, 2022 - openaccess.thecvf.com

Image animation brings life to the static object in the source image according to the driving
video. Recent works attempt to perform motion transfer on arbitrary objects through …

被引用次数：132 相关文章所有 5 个版本

[PDF] arxiv.org

Styleheat: One-shot high-resolution editable talking face generation via pre-trained stylegan

F Yin, Y Zhang, X Cun, M Cao, Y Fan, X Wang… - European conference on …, 2022 - Springer

One-shot talking face generation aims at synthesizing a high-quality talking face video from
an arbitrary portrait image, driven by a video or an audio segment. In this work, we provide a …

被引用次数：138 相关文章所有 6 个版本

[PDF] cell.com Full View

Deepfakes as a threat to a speaker and facial recognition: An overview of tools and attack vectors

A Firc, K Malinka, P Hanáček - Heliyon, 2023 - cell.com

Deepfakes present an emerging threat in cyberspace. Recent developments in machine
learning make deepfakes highly believable, and very difficult to differentiate between what is …

被引用次数：18 相关文章所有 7 个版本