Palette: Image-to-image diffusion models

C Saharia, W Chan, H Chang, C Lee, J Ho… - ACM SIGGRAPH 2022 …, 2022 - dl.acm.org
This paper develops a unified framework for image-to-image translation based on
conditional diffusion models and evaluates this framework on four challenging image-to …

Large scale image completion via co-modulated generative adversarial networks

S Zhao, J Cui, Y Sheng, Y Dong, X Liang… - arXiv preprint arXiv …, 2021 - arxiv.org
Numerous task-specific variants of conditional generative adversarial networks have been
developed for image completion. Yet, a serious limitation remains that all existing algorithms …

A task is worth one word: Learning with task prompts for high-quality versatile image inpainting

J Zhuang, Y Zeng, W Liu, C Yuan, K Chen - European Conference on …, 2025 - Springer
Advancing image inpainting is challenging as it requires filling user-specified regions for
various intents, such as background filling and object synthesis. Existing approaches focus …

Panogen: Text-conditioned panoramic environment generation for vision-and-language navigation

J Li, M Bansal - Advances in Neural Information Processing …, 2023 - proceedings.neurips.cc
Abstract Vision-and-Language Navigation requires the agent to follow language instructions
to navigate through 3D environments. One main challenge in Vision-and-Language …

Infinitenature-zero: Learning perpetual view generation of natural scenes from single images

Z Li, Q Wang, N Snavely, A Kanazawa - European Conference on …, 2022 - Springer
We present a method for learning to generate unbounded flythrough videos of natural
scenes starting from a single view. This capability is learned from a collection of single …

Infinite nature: Perpetual view generation of natural scenes from a single image

A Liu, R Tucker, V Jampani… - Proceedings of the …, 2021 - openaccess.thecvf.com
We introduce the problem of perpetual view generation-long-range generation of novel
views corresponding to an arbitrarily long camera trajectory given a single image. This is a …

Pixelsynth: Generating a 3d-consistent experience from a single image

C Rockwell, DF Fouhey… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Recent advancements in differentiable rendering and 3D reasoning have driven exciting
results in novel view synthesis from a single image. Despite realistic results, methods are …

Persistent nature: A generative model of unbounded 3d worlds

L Chai, R Tucker, Z Li, P Isola… - Proceedings of the …, 2023 - openaccess.thecvf.com
Despite increasingly realistic image quality, recent 3D image generative models often
operate on 3D volumes of fixed extent with limited camera motions. We investigate the task …

Training-free diffusion model adaptation for variable-sized text-to-image synthesis

Z Jin, X Shen, B Li, X Xue - Advances in Neural Information …, 2023 - proceedings.neurips.cc
Diffusion models (DMs) have recently gained attention with state-of-the-art performance in
text-to-image synthesis. Abiding by the tradition in deep learning, DMs are trained and …

Any-resolution training for high-resolution image synthesis

L Chai, M Gharbi, E Shechtman, P Isola… - European Conference on …, 2022 - Springer
Generative models operate at fixed resolution, even though natural images come in a variety
of sizes. As high-resolution details are downsampled away and low-resolution images are …