Object-centric image generation from layouts

S Frolov, T Hinz, F Raue, J Hees, A Dengel - Neural Networks, 2021 - Elsevier

With the advent of generative adversarial networks, synthesizing images from text
descriptions has recently become an active research area. It is a flexible and intuitive way for …

被引用次数：188 相关文章所有 9 个版本

[PDF] researchgate.net

Image generation: A review

M Elasri, O Elharrouss, S Al-Maadeed, H Tairi - Neural Processing Letters, 2022 - Springer

The creation of an image from another and from different types of data including text, scene
graph, and object layout, is one of the very challenging tasks in computer vision. In addition …

被引用次数：71 相关文章所有 6 个版本

[PDF] thecvf.com

Gligen: Open-set grounded text-to-image generation

Y Li, H Liu, Q Wu, F Mu, J Yang… - Proceedings of the …, 2023 - openaccess.thecvf.com

Large-scale text-to-image diffusion models have made amazing advances. However, the
status quo is to use text input alone, which can impede controllability. In this work, we …

被引用次数：177 相关文章所有 5 个版本

[PDF] thecvf.com

Spatext: Spatio-textual representation for controllable image generation

O Avrahami, T Hayes, O Gafni… - Proceedings of the …, 2023 - openaccess.thecvf.com

Recent text-to-image diffusion models are able to generate convincing results of
unprecedented quality. However, it is nearly impossible to control the shapes of different …

被引用次数：144 相关文章所有 5 个版本

[PDF] thecvf.com

High-resolution image synthesis with latent diffusion models

R Rombach, A Blattmann, D Lorenz… - Proceedings of the …, 2022 - openaccess.thecvf.com

By decomposing the image formation process into a sequential application of denoising
autoencoders, diffusion models (DMs) achieve state-of-the-art synthesis results on image …

被引用次数：10470 相关文章所有 11 个版本

[PDF] thecvf.com

Boxdiff: Text-to-image synthesis with training-free box-constrained diffusion

J Xie, Y Li, Y Huang, H Liu, W Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com

Recent text-to-image diffusion models have demonstrated an astonishing capacity to
generate high-quality images. However, researchers mainly studied the way of synthesizing …

被引用次数：93 相关文章所有 8 个版本

[PDF] thecvf.com

Reco: Region-controlled text-to-image generation

Z Yang, J Wang, Z Gan, L Li, K Lin… - Proceedings of the …, 2023 - openaccess.thecvf.com

Recently, large-scale text-to-image (T2I) models have shown impressive performance in
generating high-fidelity images, but with limited controllability, eg, precisely specifying the …

被引用次数：94 相关文章所有 5 个版本

[PDF] arxiv.org

Disentangled representation learning

X Wang, H Chen, S Tang, Z Wu, W Zhu - arXiv preprint arXiv:2211.11695, 2022 - arxiv.org

Disentangled Representation Learning (DRL) aims to learn a model capable of identifying
and disentangling the underlying factors hidden in the observable data in representation …

被引用次数：140 相关文章所有 4 个版本

[PDF] neurips.cc

Instance-conditioned gan

A Casanova, M Careil, J Verbeek… - Advances in …, 2021 - proceedings.neurips.cc

Abstract Generative Adversarial Networks (GANs) can generate near photo realistic images
in narrow domains such as human faces. Yet, modeling complex distributions of datasets …

被引用次数：124 相关文章所有 6 个版本

[PDF] aaai.org

Frido: Feature pyramid diffusion for complex scene image synthesis

WC Fan, YC Chen, DD Chen, Y Cheng… - Proceedings of the …, 2023 - ojs.aaai.org

Diffusion models (DMs) have shown great potential for high-quality image synthesis.
However, when it comes to producing images with complex scenes, how to properly …

被引用次数：62 相关文章所有 4 个版本