Dreamix: Video diffusion models are general video editors

E Molad, E Horwitz, D Valevski, AR Acha… - arXiv preprint arXiv …, 2023 - arxiv.org
Text-driven image and video diffusion models have recently achieved unprecedented
generation realism. While diffusion models have been successfully applied for image …

CelebV-HQ: A large-scale video facial attributes dataset

H Zhu, W Wu, W Zhu, L Jiang, S Tang, L Zhang… - European conference on …, 2022 - Springer
Large-scale datasets have played indispensable roles in the recent success of face
generation/editing and significantly facilitated the advances of emerging research fields …

Vtoonify: Controllable high-resolution portrait video style transfer

S Yang, L Jiang, Z Liu, CC Loy - ACM Transactions on Graphics (TOG), 2022 - dl.acm.org
Generating high-quality artistic portrait videos is an important and desirable task in computer
graphics and vision. Although a series of successful portrait image toonification models built …

State‐of‐the‐Art in the Architecture, Methods and Applications of StyleGAN

AH Bermano, R Gal, Y Alaluf, R Mokady… - Computer Graphics …, 2022 - Wiley Online Library
Abstract Generative Adversarial Networks (GANs) have established themselves as a
prevalent approach to image synthesis. Of these, StyleGAN offers a fascinating case study …

Style transformer for image inversion and editing

X Hu, Q Huang, Z Shi, S Li, C Gao… - Proceedings of the …, 2022 - openaccess.thecvf.com
Existing GAN inversion methods fail to provide codes for reliable reconstruction and flexible
editing simultaneously. This paper presents a transformer-based image inversion and …

Stitch it in time: Gan-based facial editing of real videos

R Tzaban, R Mokady, R Gal, A Bermano… - SIGGRAPH Asia 2022 …, 2022 - dl.acm.org
The ability of Generative Adversarial Networks to encode rich semantics within their latent
space has been widely adopted for facial image editing. However, replicating their success …

Deepfake generation and detection: A benchmark and survey

G Pei, J Zhang, M Hu, Z Zhang, C Wang, Y Wu… - arXiv preprint arXiv …, 2024 - arxiv.org
Deepfake is a technology dedicated to creating highly realistic facial images and videos
under specific conditions, which has significant application potential in fields such as …

Stylet2i: Toward compositional and high-fidelity text-to-image synthesis

Z Li, MR Min, K Li, C Xu - … of the IEEE/CVF Conference on …, 2022 - openaccess.thecvf.com
Although progress has been made for text-to-image synthesis, previous methods fall short of
generalizing to unseen or underrepresented attribute compositions in the input text. Lacking …

Third time's the charm? image and video editing with stylegan3

Y Alaluf, O Patashnik, Z Wu, A Zamir… - … on Computer Vision, 2022 - Springer
StyleGAN is arguably one of the most intriguing and well-studied generative models,
demonstrating impressive performance in image generation, inversion, and manipulation. In …

Fine-grained face swapping via regional gan inversion

Z Liu, M Li, Y Zhang, C Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com
We present a novel paradigm for high-fidelity face swapping that faithfully preserves the
desired subtle geometry and texture details. We rethink face swapping from the perspective …