- 学术资源搜索

Dreamix: Video diffusion models are general video editors

E Molad, E Horwitz, D Valevski, AR Acha… - arXiv preprint arXiv …, 2023 - arxiv.org

Text-driven image and video diffusion models have recently achieved unprecedented
generation realism. While diffusion models have been successfully applied for image …

被引用次数：172 相关文章所有 3 个版本

[PDF] arxiv.org

CelebV-HQ: A large-scale video facial attributes dataset

H Zhu, W Wu, W Zhu, L Jiang, S Tang, L Zhang… - European conference on …, 2022 - Springer

Large-scale datasets have played indispensable roles in the recent success of face
generation/editing and significantly facilitated the advances of emerging research fields …

被引用次数：115 相关文章所有 6 个版本

[PDF] arxiv.org

Vtoonify: Controllable high-resolution portrait video style transfer

S Yang, L Jiang, Z Liu, CC Loy - ACM Transactions on Graphics (TOG), 2022 - dl.acm.org

Generating high-quality artistic portrait videos is an important and desirable task in computer
graphics and vision. Although a series of successful portrait image toonification models built …

被引用次数：65 相关文章所有 4 个版本

[PDF] arxiv.org

State‐of‐the‐Art in the Architecture, Methods and Applications of StyleGAN

AH Bermano, R Gal, Y Alaluf, R Mokady… - Computer Graphics …, 2022 - Wiley Online Library

Abstract Generative Adversarial Networks (GANs) have established themselves as a
prevalent approach to image synthesis. Of these, StyleGAN offers a fascinating case study …

被引用次数：86 相关文章所有 7 个版本

[PDF] thecvf.com

Style transformer for image inversion and editing

X Hu, Q Huang, Z Shi, S Li, C Gao… - Proceedings of the …, 2022 - openaccess.thecvf.com

Existing GAN inversion methods fail to provide codes for reliable reconstruction and flexible
editing simultaneously. This paper presents a transformer-based image inversion and …

被引用次数：61 相关文章所有 6 个版本

Stitch it in time: Gan-based facial editing of real videos

R Tzaban, R Mokady, R Gal, A Bermano… - SIGGRAPH Asia 2022 …, 2022 - dl.acm.org

The ability of Generative Adversarial Networks to encode rich semantics within their latent
space has been widely adopted for facial image editing. However, replicating their success …

被引用次数：81 相关文章所有 3 个版本

[PDF] arxiv.org

Deepfake generation and detection: A benchmark and survey

G Pei, J Zhang, M Hu, Z Zhang, C Wang, Y Wu… - arXiv preprint arXiv …, 2024 - arxiv.org

Deepfake is a technology dedicated to creating highly realistic facial images and videos
under specific conditions, which has significant application potential in fields such as …

被引用次数：24 相关文章所有 2 个版本

[PDF] thecvf.com

Stylet2i: Toward compositional and high-fidelity text-to-image synthesis

Z Li, MR Min, K Li, C Xu - … of the IEEE/CVF Conference on …, 2022 - openaccess.thecvf.com

Although progress has been made for text-to-image synthesis, previous methods fall short of
generalizing to unseen or underrepresented attribute compositions in the input text. Lacking …

被引用次数：55 相关文章所有 8 个版本

Third time's the charm? image and video editing with stylegan3

Y Alaluf, O Patashnik, Z Wu, A Zamir… - … on Computer Vision, 2022 - Springer

StyleGAN is arguably one of the most intriguing and well-studied generative models,
demonstrating impressive performance in image generation, inversion, and manipulation. In …

被引用次数：78 相关文章所有 4 个版本

[PDF] thecvf.com

Fine-grained face swapping via regional gan inversion

Z Liu, M Li, Y Zhang, C Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com

We present a novel paradigm for high-fidelity face swapping that faithfully preserves the
desired subtle geometry and texture details. We rethink face swapping from the perspective …

被引用次数：56 相关文章所有 5 个版本