Pair-diffusion: Object-level image editing with structure-and-appearance paired diffusion models

J Zhang, C Herrmann, J Hur… - Advances in …, 2024 - proceedings.neurips.cc

Text-to-image diffusion models have made significant advances in generating and editing
high-quality images. As a result, numerous approaches have explored the ability of diffusion …

被引用次数：123 相关文章所有 8 个版本

[PDF] neurips.cc

In-context learning unlocked for diffusion models

Z Wang, Y Jiang, Y Lu, P He, W Chen… - Advances in …, 2023 - proceedings.neurips.cc

Abstract We present Prompt Diffusion, a framework for enabling in-context learning in
diffusion-based generative models. Given a pair of task-specific example images, such as …

被引用次数：53 相关文章所有 6 个版本

[PDF] thecvf.com

Prompt-free diffusion: Taking" text" out of text-to-image diffusion models

X Xu, J Guo, Z Wang, G Huang… - Proceedings of the …, 2024 - openaccess.thecvf.com

Abstract Text-to-image (T2I) research has grown explosively in the past year owing to the
large-scale pre-trained diffusion models and many emerging personalization and editing …

被引用次数：47 相关文章所有 4 个版本

[PDF] arxiv.org

Diffusion model-based image editing: A survey

Y Huang, J Huang, Y Liu, M Yan, J Lv, J Liu… - arXiv preprint arXiv …, 2024 - arxiv.org

Denoising diffusion models have emerged as a powerful tool for various image generation
and editing tasks, facilitating the synthesis of visual content in an unconditional or input …

被引用次数：61 相关文章所有 2 个版本

[PDF] thecvf.com

Instancediffusion: Instance-level control for image generation

X Wang, T Darrell, SS Rambhatla… - Proceedings of the …, 2024 - openaccess.thecvf.com

Text-to-image diffusion models produce high quality images but do not offer control over
individual instances in the image. We introduce InstanceDiffusion that adds precise instance …

被引用次数：46 相关文章所有 3 个版本

[PDF] neurips.cc

Compositional abilities emerge multiplicatively: Exploring diffusion models on a synthetic task

M Okawa, ES Lubana, R Dick… - Advances in Neural …, 2024 - proceedings.neurips.cc

Modern generative models exhibit unprecedented capabilities to generate extremely
realistic data. However, given the inherent compositionality of real world, reliable use of …

被引用次数：29 相关文章所有 7 个版本

[PDF] thecvf.com

Smooth diffusion: Crafting smooth latent spaces in diffusion models

J Guo, X Xu, Y Pu, Z Ni, C Wang… - Proceedings of the …, 2024 - openaccess.thecvf.com

Recently diffusion models have made remarkable progress in text-to-image (T2I) generation
synthesizing images with high fidelity and diverse contents. Despite this advancement latent …

被引用次数：13 相关文章所有 3 个版本

[PDF] thecvf.com

Zone: Zero-shot instruction-guided local editing

S Li, B Zeng, Y Feng, S Gao, X Liu… - Proceedings of the …, 2024 - openaccess.thecvf.com

Recent advances in vision-language models like Stable Diffusion have shown remarkable
power in creative image synthesis and editing. However most existing text-to-image editing …

被引用次数：22 相关文章所有 3 个版本

[PDF] openreview.net

Dginstyle: Domain-generalizable semantic segmentation with image diffusion models and stylized semantic control

Y Jia, L Hoyer, S Huang, T Wang, L Van Gool… - … on Computer Vision, 2025 - Springer

Large, pretrained latent diffusion models (LDMs) have demonstrated an extraordinary ability
to generate creative content, specialize to user data through few-shot fine-tuning, and …

被引用次数：11 相关文章所有 3 个版本

[PDF] thecvf.com

Place: Adaptive layout-semantic fusion for semantic image synthesis

Z Lv, Y Wei, W Zuo, KYK Wong - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com

Recent advancements in large-scale pre-trained text-to-image models have led to
remarkable progress in semantic image synthesis. Nevertheless synthesizing high-quality …

被引用次数：7 相关文章所有 4 个版本