Anydoor: Zero-shot object-level image customization
This work presents AnyDoor a diffusion-based image generator with the power to teleport
target objects to new scenes at user-specified locations with desired shapes. Instead of …
target objects to new scenes at user-specified locations with desired shapes. Instead of …
Videocomposer: Compositional video synthesis with motion controllability
The pursuit of controllability as a higher standard of visual content creation has yielded
remarkable progress in customizable image synthesis. However, achieving controllable …
remarkable progress in customizable image synthesis. However, achieving controllable …
Svdiff: Compact parameter space for diffusion fine-tuning
Recently, diffusion models have achieved remarkable success in text-to-image generation,
enabling the creation of high-quality images from text prompts and various conditions …
enabling the creation of high-quality images from text prompts and various conditions …
Uni-controlnet: All-in-one control to text-to-image diffusion models
Text-to-Image diffusion models have made tremendous progress over the past two years,
enabling the generation of highly realistic images based on open-domain text descriptions …
enabling the generation of highly realistic images based on open-domain text descriptions …
Raphael: Text-to-image generation via large mixture of diffusion paths
Text-to-image generation has recently witnessed remarkable achievements. We introduce a
text-conditional image diffusion model, termed RAPHAEL, to generate highly artistic images …
text-conditional image diffusion model, termed RAPHAEL, to generate highly artistic images …
Photomaker: Customizing realistic human photos via stacked id embedding
Recent advances in text-to-image generation have made remarkable progress in
synthesizing realistic human photos conditioned on given text prompts. However existing …
synthesizing realistic human photos conditioned on given text prompts. However existing …
Mix-of-show: Decentralized low-rank adaptation for multi-concept customization of diffusion models
Public large-scale text-to-image diffusion models, such as Stable Diffusion, have gained
significant attention from the community. These models can be easily customized for new …
significant attention from the community. These models can be easily customized for new …
Momentdiff: Generative video moment retrieval from random to real
Video moment retrieval pursues an efficient and generalized solution to identify the specific
temporal segments within an untrimmed video that correspond to a given language …
temporal segments within an untrimmed video that correspond to a given language …
Dreamvideo: Composing your dream videos with customized subject and motion
Customized generation using diffusion models has made impressive progress in image
generation but remains unsatisfactory in the challenging video generation task as it requires …
generation but remains unsatisfactory in the challenging video generation task as it requires …
Styledrop: Text-to-image synthesis of any style
Pre-trained large text-to-image models synthesize impressive images with an appropriate
use of text prompts. However, ambiguities inherent in natural language, and out-of …
use of text prompts. However, ambiguities inherent in natural language, and out-of …