A tale of two features: Stable diffusion complements dino for zero-shot semantic correspondence
Text-to-image diffusion models have made significant advances in generating and editing
high-quality images. As a result, numerous approaches have explored the ability of diffusion …
high-quality images. As a result, numerous approaches have explored the ability of diffusion …
In-context learning unlocked for diffusion models
Abstract We present Prompt Diffusion, a framework for enabling in-context learning in
diffusion-based generative models. Given a pair of task-specific example images, such as …
diffusion-based generative models. Given a pair of task-specific example images, such as …
Prompt-free diffusion: Taking" text" out of text-to-image diffusion models
Abstract Text-to-image (T2I) research has grown explosively in the past year owing to the
large-scale pre-trained diffusion models and many emerging personalization and editing …
large-scale pre-trained diffusion models and many emerging personalization and editing …
Diffusion model-based image editing: A survey
Denoising diffusion models have emerged as a powerful tool for various image generation
and editing tasks, facilitating the synthesis of visual content in an unconditional or input …
and editing tasks, facilitating the synthesis of visual content in an unconditional or input …
Instancediffusion: Instance-level control for image generation
Text-to-image diffusion models produce high quality images but do not offer control over
individual instances in the image. We introduce InstanceDiffusion that adds precise instance …
individual instances in the image. We introduce InstanceDiffusion that adds precise instance …
Compositional abilities emerge multiplicatively: Exploring diffusion models on a synthetic task
Modern generative models exhibit unprecedented capabilities to generate extremely
realistic data. However, given the inherent compositionality of real world, reliable use of …
realistic data. However, given the inherent compositionality of real world, reliable use of …
Smooth diffusion: Crafting smooth latent spaces in diffusion models
Recently diffusion models have made remarkable progress in text-to-image (T2I) generation
synthesizing images with high fidelity and diverse contents. Despite this advancement latent …
synthesizing images with high fidelity and diverse contents. Despite this advancement latent …
Zone: Zero-shot instruction-guided local editing
Recent advances in vision-language models like Stable Diffusion have shown remarkable
power in creative image synthesis and editing. However most existing text-to-image editing …
power in creative image synthesis and editing. However most existing text-to-image editing …
Dginstyle: Domain-generalizable semantic segmentation with image diffusion models and stylized semantic control
Large, pretrained latent diffusion models (LDMs) have demonstrated an extraordinary ability
to generate creative content, specialize to user data through few-shot fine-tuning, and …
to generate creative content, specialize to user data through few-shot fine-tuning, and …
Place: Adaptive layout-semantic fusion for semantic image synthesis
Recent advancements in large-scale pre-trained text-to-image models have led to
remarkable progress in semantic image synthesis. Nevertheless synthesizing high-quality …
remarkable progress in semantic image synthesis. Nevertheless synthesizing high-quality …