Stylegan-t: Unlocking the power of gans for fast large-scale text-to-image synthesis

A Kebaili, J Lapuyade-Lahorgue, S Ruan - Journal of Imaging, 2023 - mdpi.com

Deep learning has become a popular tool for medical image analysis, but the limited
availability of training data remains a major challenge, particularly in the medical field where …

被引用次数：86 相关文章所有 9 个版本

A review on generative adversarial networks for image generation

VLT De Souza, BAD Marques, HC Batagelo… - Computers & …, 2023 - Elsevier

Abstract Generative Adversarial Networks (GANs) are a type of deep learning architecture
that uses two networks namely a generator and a discriminator that, by competing against …

被引用次数：32 相关文章所有 2 个版本

[PDF] thecvf.com

Scaling up gans for text-to-image synthesis

M Kang, JY Zhu, R Zhang, J Park… - Proceedings of the …, 2023 - openaccess.thecvf.com

The recent success of text-to-image synthesis has taken the world by storm and captured the
general public's imagination. From a technical standpoint, it also marked a drastic change in …

被引用次数：329 相关文章所有 6 个版本

[PDF] thecvf.com

Elite: Encoding visual concepts into textual embeddings for customized text-to-image generation

Y Wei, Y Zhang, Z Ji, J Bai… - Proceedings of the …, 2023 - openaccess.thecvf.com

In addition to the unprecedented ability in imaginary creation, large text-to-image models are
expected to take customized concepts in image generation. Existing works generally learn …

被引用次数：183 相关文章所有 5 个版本

[PDF] thecvf.com

Dreambooth3d: Subject-driven text-to-3d generation

A Raj, S Kaza, B Poole, M Niemeyer… - Proceedings of the …, 2023 - openaccess.thecvf.com

We present DreamBooth3D, an approach to personalize text-to-3D generative models from
as few as 3-6 casually captured images of a subject. Our approach combines recent …

被引用次数：149 相关文章所有 5 个版本

[PDF] thecvf.com

Ablating concepts in text-to-image diffusion models

N Kumari, B Zhang, SY Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com

Large-scale text-to-image diffusion models can generate high-fidelity images with powerful
compositional ability. However, these models are typically trained on an enormous amount …

被引用次数：102 相关文章所有 6 个版本

[PDF] arxiv.org

Adversarial diffusion distillation

A Sauer, D Lorenz, A Blattmann… - arXiv preprint arXiv …, 2023 - arxiv.org

We introduce Adversarial Diffusion Distillation (ADD), a novel training approach that
efficiently samples large-scale foundational image diffusion models in just 1-4 steps while …

被引用次数：103 相关文章所有 2 个版本

[PDF] arxiv.org

Controlvideo: Training-free controllable text-to-video generation

Y Zhang, Y Wei, D Jiang, X Zhang, W Zuo… - arXiv preprint arXiv …, 2023 - arxiv.org

Text-driven diffusion models have unlocked unprecedented abilities in image generation,
whereas their video counterpart still lags behind due to the excessive training cost of …

被引用次数：115 相关文章所有 3 个版本

[PDF] thecvf.com

One-step diffusion with distribution matching distillation

T Yin, M Gharbi, R Zhang… - Proceedings of the …, 2024 - openaccess.thecvf.com

Diffusion models generate high-quality images but require dozens of forward passes. We
introduce Distribution Matching Distillation (DMD) a procedure to transform a diffusion model …

被引用次数：35 相关文章所有 3 个版本

[PDF] thecvf.com

Localizing object-level shape variations with text-to-image diffusion models

O Patashnik, D Garibi, I Azuri… - Proceedings of the …, 2023 - openaccess.thecvf.com

Text-to-image models give rise to workflows which often begin with an exploration step,
where users sift through a large collection of generated images. The global nature of the text …

被引用次数：63 相关文章所有 5 个版本