Cross-image attention for zero-shot appearance transfer
Recent advancements in text-to-image generative models have demonstrated a remarkable
ability to capture a deep semantic understanding of images. In this work, we leverage this …
ability to capture a deep semantic understanding of images. In this work, we leverage this …
Surgicalsam: Efficient class promptable surgical instrument segmentation
The Segment Anything Model (SAM) is a powerful foundation model that has revolutionised
image segmentation. To apply SAM to surgical instrument segmentation, a common …
image segmentation. To apply SAM to surgical instrument segmentation, a common …
Alignsam: Aligning segment anything model to open context via reinforcement learning
Powered by massive curated training data Segment Anything Model (SAM) has
demonstrated its impressive generalization capabilities in open-world scenarios with the …
demonstrated its impressive generalization capabilities in open-world scenarios with the …
Conform: Contrast is all you need for high-fidelity text-to-image diffusion models
Images produced by text-to-image diffusion models might not always faithfully represent the
semantic intent of the provided text prompt where the model might overlook or entirely fail to …
semantic intent of the provided text prompt where the model might overlook or entirely fail to …
Initno: Boosting text-to-image diffusion models via initial noise optimization
Recent strides in the development of diffusion models exemplified by advancements such as
Stable Diffusion have underscored their remarkable prowess in generating visually …
Stable Diffusion have underscored their remarkable prowess in generating visually …
R&b: Region and boundary aware zero-shot grounded text-to-image generation
Recent text-to-image (T2I) diffusion models have achieved remarkable progress in
generating high-quality images given text-prompts as input. However, these models fail to …
generating high-quality images given text-prompts as input. However, these models fail to …
Migc++: Advanced multi-instance generation controller for image synthesis
We introduce the Multi-Instance Generation (MIG) task, which focuses on generating
multiple instances within a single image, each accurately placed at predefined positions with …
multiple instances within a single image, each accurately placed at predefined positions with …
Quantum circuit synthesis with diffusion models
Quantum computing has recently emerged as a transformative technology. Yet, its promised
advantages rely on efficiently translating quantum operations into viable physical …
advantages rely on efficiently translating quantum operations into viable physical …
Diffusion for natural image matting
Existing natural image matting algorithms inevitably have flaws in their predictions on
difficult cases, and their one-step prediction manner cannot further correct these errors. In …
difficult cases, and their one-step prediction manner cannot further correct these errors. In …