[HTML][HTML] Adversarial text-to-image synthesis: A review
With the advent of generative adversarial networks, synthesizing images from text
descriptions has recently become an active research area. It is a flexible and intuitive way for …
descriptions has recently become an active research area. It is a flexible and intuitive way for …
Image generation: A review
The creation of an image from another and from different types of data including text, scene
graph, and object layout, is one of the very challenging tasks in computer vision. In addition …
graph, and object layout, is one of the very challenging tasks in computer vision. In addition …
Gligen: Open-set grounded text-to-image generation
Large-scale text-to-image diffusion models have made amazing advances. However, the
status quo is to use text input alone, which can impede controllability. In this work, we …
status quo is to use text input alone, which can impede controllability. In this work, we …
Spatext: Spatio-textual representation for controllable image generation
Recent text-to-image diffusion models are able to generate convincing results of
unprecedented quality. However, it is nearly impossible to control the shapes of different …
unprecedented quality. However, it is nearly impossible to control the shapes of different …
High-resolution image synthesis with latent diffusion models
R Rombach, A Blattmann, D Lorenz… - Proceedings of the …, 2022 - openaccess.thecvf.com
By decomposing the image formation process into a sequential application of denoising
autoencoders, diffusion models (DMs) achieve state-of-the-art synthesis results on image …
autoencoders, diffusion models (DMs) achieve state-of-the-art synthesis results on image …
Boxdiff: Text-to-image synthesis with training-free box-constrained diffusion
Recent text-to-image diffusion models have demonstrated an astonishing capacity to
generate high-quality images. However, researchers mainly studied the way of synthesizing …
generate high-quality images. However, researchers mainly studied the way of synthesizing …
Reco: Region-controlled text-to-image generation
Recently, large-scale text-to-image (T2I) models have shown impressive performance in
generating high-fidelity images, but with limited controllability, eg, precisely specifying the …
generating high-fidelity images, but with limited controllability, eg, precisely specifying the …
Disentangled representation learning
Disentangled Representation Learning (DRL) aims to learn a model capable of identifying
and disentangling the underlying factors hidden in the observable data in representation …
and disentangling the underlying factors hidden in the observable data in representation …
Instance-conditioned gan
Abstract Generative Adversarial Networks (GANs) can generate near photo realistic images
in narrow domains such as human faces. Yet, modeling complex distributions of datasets …
in narrow domains such as human faces. Yet, modeling complex distributions of datasets …
Frido: Feature pyramid diffusion for complex scene image synthesis
Diffusion models (DMs) have shown great potential for high-quality image synthesis.
However, when it comes to producing images with complex scenes, how to properly …
However, when it comes to producing images with complex scenes, how to properly …