A tale of two features: Stable diffusion complements dino for zero-shot semantic correspondence

J Zhang, C Herrmann, J Hur… - Advances in …, 2024 - proceedings.neurips.cc
Text-to-image diffusion models have made significant advances in generating and editing
high-quality images. As a result, numerous approaches have explored the ability of diffusion …

In-context learning unlocked for diffusion models

Z Wang, Y Jiang, Y Lu, P He, W Chen… - Advances in …, 2023 - proceedings.neurips.cc
Abstract We present Prompt Diffusion, a framework for enabling in-context learning in
diffusion-based generative models. Given a pair of task-specific example images, such as …

Prompt-free diffusion: Taking" text" out of text-to-image diffusion models

X Xu, J Guo, Z Wang, G Huang… - Proceedings of the …, 2024 - openaccess.thecvf.com
Abstract Text-to-image (T2I) research has grown explosively in the past year owing to the
large-scale pre-trained diffusion models and many emerging personalization and editing …

Diffusion model-based image editing: A survey

Y Huang, J Huang, Y Liu, M Yan, J Lv, J Liu… - arXiv preprint arXiv …, 2024 - arxiv.org
Denoising diffusion models have emerged as a powerful tool for various image generation
and editing tasks, facilitating the synthesis of visual content in an unconditional or input …

Instancediffusion: Instance-level control for image generation

X Wang, T Darrell, SS Rambhatla… - Proceedings of the …, 2024 - openaccess.thecvf.com
Text-to-image diffusion models produce high quality images but do not offer control over
individual instances in the image. We introduce InstanceDiffusion that adds precise instance …

Compositional abilities emerge multiplicatively: Exploring diffusion models on a synthetic task

M Okawa, ES Lubana, R Dick… - Advances in Neural …, 2024 - proceedings.neurips.cc
Modern generative models exhibit unprecedented capabilities to generate extremely
realistic data. However, given the inherent compositionality of real world, reliable use of …

Smooth diffusion: Crafting smooth latent spaces in diffusion models

J Guo, X Xu, Y Pu, Z Ni, C Wang… - Proceedings of the …, 2024 - openaccess.thecvf.com
Recently diffusion models have made remarkable progress in text-to-image (T2I) generation
synthesizing images with high fidelity and diverse contents. Despite this advancement latent …

Zone: Zero-shot instruction-guided local editing

S Li, B Zeng, Y Feng, S Gao, X Liu… - Proceedings of the …, 2024 - openaccess.thecvf.com
Recent advances in vision-language models like Stable Diffusion have shown remarkable
power in creative image synthesis and editing. However most existing text-to-image editing …

Dginstyle: Domain-generalizable semantic segmentation with image diffusion models and stylized semantic control

Y Jia, L Hoyer, S Huang, T Wang, L Van Gool… - … on Computer Vision, 2025 - Springer
Large, pretrained latent diffusion models (LDMs) have demonstrated an extraordinary ability
to generate creative content, specialize to user data through few-shot fine-tuning, and …

Place: Adaptive layout-semantic fusion for semantic image synthesis

Z Lv, Y Wei, W Zuo, KYK Wong - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Recent advancements in large-scale pre-trained text-to-image models have led to
remarkable progress in semantic image synthesis. Nevertheless synthesizing high-quality …