Deepfakes generation and detection: a short survey

Z Akhtar - Journal of Imaging, 2023 - mdpi.com
Advancements in deep learning techniques and the availability of free, large databases
have made it possible, even for non-technical people, to either manipulate or generate …

Learning in audio-visual context: A review, analysis, and new perspective

Y Wei, D Hu, Y Tian, X Li - arXiv preprint arXiv:2208.09579, 2022 - arxiv.org
Sight and hearing are two senses that play a vital role in human communication and scene
understanding. To mimic human perception ability, audio-visual learning, aimed at …

Pose-controllable talking face generation by implicitly modularized audio-visual representation

H Zhou, Y Sun, W Wu, CC Loy… - Proceedings of the …, 2021 - openaccess.thecvf.com
While accurate lip synchronization has been achieved for arbitrary-subject audio-driven
talking face generation, the problem of how to efficiently drive the head pose remains …

Expressive talking head generation with granular audio-visual control

B Liang, Y Pan, Z Guo, H Zhou… - Proceedings of the …, 2022 - openaccess.thecvf.com
Generating expressive talking heads is essential for creating virtual humans. However,
existing one-or few-shot methods focus on lip-sync and head motion, ignoring the emotional …

Eamm: One-shot emotional talking face via audio-based emotion-aware motion model

X Ji, H Zhou, K Wang, Q Wu, W Wu, F Xu… - ACM SIGGRAPH 2022 …, 2022 - dl.acm.org
Although significant progress has been made to audio-driven talking face generation,
existing methods either neglect facial emotion or cannot be applied to arbitrary subjects. In …

Audio2head: Audio-driven one-shot talking-head generation with natural head motion

S Wang, L Li, Y Ding, C Fan, X Yu - arXiv preprint arXiv:2107.09293, 2021 - arxiv.org
We propose an audio-driven talking-head method to generate photo-realistic talking-head
videos from a single reference image. In this work, we tackle two key challenges:(i) …

Synface: Face recognition with synthetic data

H Qiu, B Yu, D Gong, Z Li, W Liu… - Proceedings of the …, 2021 - openaccess.thecvf.com
With the recent success of deep neural networks, remarkable progress has been achieved
on face recognition. However, collecting large-scale real-world training data for face …

High-fidelity and freely controllable talking head video generation

Y Gao, Y Zhou, J Wang, X Li… - Proceedings of the …, 2023 - openaccess.thecvf.com
Talking head generation is to generate video based on a given source identity and target
motion. However, current methods face several challenges that limit the quality and …

Hyperreenact: one-shot reenactment via jointly learning to refine and retarget faces

S Bounareli, C Tzelepis, V Argyriou… - Proceedings of the …, 2023 - openaccess.thecvf.com
In this paper, we present our method for neural face reenactment, called HyperReenact, that
aims to generate realistic talking head images of a source identity, driven by a target facial …

Portraitbooth: A versatile portrait model for fast identity-preserved personalization

X Peng, J Zhu, B Jiang, Y Tai, D Luo… - Proceedings of the …, 2024 - openaccess.thecvf.com
Recent advancements in personalized image generation using diffusion models have been
noteworthy. However existing methods suffer from inefficiencies due to the requirement for …