Diffusion models: A comprehensive survey of methods and applications
Diffusion models have emerged as a powerful new family of deep generative models with
record-breaking performance in many applications, including image synthesis, video …
record-breaking performance in many applications, including image synthesis, video …
Development of metaverse for intelligent healthcare
The metaverse integrates physical and virtual realities, enabling humans and their avatars to
interact in an environment supported by technologies such as high-speed internet, virtual …
interact in an environment supported by technologies such as high-speed internet, virtual …
Adding conditional control to text-to-image diffusion models
L Zhang, A Rao, M Agrawala - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
We present ControlNet, a neural network architecture to add spatial conditioning controls to
large, pretrained text-to-image diffusion models. ControlNet locks the production-ready large …
large, pretrained text-to-image diffusion models. ControlNet locks the production-ready large …
Instructpix2pix: Learning to follow image editing instructions
We propose a method for editing images from human instructions: given an input image and
a written instruction that tells the model what to do, our model follows these instructions to …
a written instruction that tells the model what to do, our model follows these instructions to …
Magic3d: High-resolution text-to-3d content creation
Recently, DreamFusion demonstrated the utility of a pretrained text-to-image diffusion model
to optimize Neural Radiance Fields (NeRF), achieving remarkable text-to-3D synthesis …
to optimize Neural Radiance Fields (NeRF), achieving remarkable text-to-3D synthesis …
Align your latents: High-resolution video synthesis with latent diffusion models
Abstract Latent Diffusion Models (LDMs) enable high-quality image synthesis while avoiding
excessive compute demands by training a diffusion model in a compressed lower …
excessive compute demands by training a diffusion model in a compressed lower …
Multi-concept customization of text-to-image diffusion
While generative models produce high-quality images of concepts learned from a large-
scale database, a user often wishes to synthesize instantiations of their own concepts (for …
scale database, a user often wishes to synthesize instantiations of their own concepts (for …
T2i-adapter: Learning adapters to dig out more controllable ability for text-to-image diffusion models
The incredible generative ability of large-scale text-to-image (T2I) models has demonstrated
strong power of learning complex structures and meaningful semantics. However, relying …
strong power of learning complex structures and meaningful semantics. However, relying …
Imagic: Text-based real image editing with diffusion models
Text-conditioned image editing has recently attracted considerable interest. However, most
methods are currently limited to one of the following: specific editing types (eg, object …
methods are currently limited to one of the following: specific editing types (eg, object …
Tune-a-video: One-shot tuning of image diffusion models for text-to-video generation
To replicate the success of text-to-image (T2I) generation, recent works employ large-scale
video datasets to train a text-to-video (T2V) generator. Despite their promising results, such …
video datasets to train a text-to-video (T2V) generator. Despite their promising results, such …