Very long natural scenery image prediction by outpainting

C Saharia, W Chan, H Chang, C Lee, J Ho… - ACM SIGGRAPH 2022 …, 2022 - dl.acm.org

This paper develops a unified framework for image-to-image translation based on
conditional diffusion models and evaluates this framework on four challenging image-to …

被引用次数：1402 相关文章所有 10 个版本

[PDF] arxiv.org

Large scale image completion via co-modulated generative adversarial networks

S Zhao, J Cui, Y Sheng, Y Dong, X Liang… - arXiv preprint arXiv …, 2021 - arxiv.org

Numerous task-specific variants of conditional generative adversarial networks have been
developed for image completion. Yet, a serious limitation remains that all existing algorithms …

被引用次数：312 相关文章所有 9 个版本

[PDF] arxiv.org

A task is worth one word: Learning with task prompts for high-quality versatile image inpainting

J Zhuang, Y Zeng, W Liu, C Yuan, K Chen - European Conference on …, 2025 - Springer

Advancing image inpainting is challenging as it requires filling user-specified regions for
various intents, such as background filling and object synthesis. Existing approaches focus …

被引用次数：36 相关文章所有 2 个版本

[PDF] neurips.cc

Panogen: Text-conditioned panoramic environment generation for vision-and-language navigation

J Li, M Bansal - Advances in Neural Information Processing …, 2023 - proceedings.neurips.cc

Abstract Vision-and-Language Navigation requires the agent to follow language instructions
to navigate through 3D environments. One main challenge in Vision-and-Language …

被引用次数：40 相关文章所有 5 个版本

[PDF] arxiv.org

Infinitenature-zero: Learning perpetual view generation of natural scenes from single images

Z Li, Q Wang, N Snavely, A Kanazawa - European Conference on …, 2022 - Springer

We present a method for learning to generate unbounded flythrough videos of natural
scenes starting from a single view. This capability is learned from a collection of single …

被引用次数：53 相关文章所有 8 个版本

[PDF] thecvf.com

Infinite nature: Perpetual view generation of natural scenes from a single image

A Liu, R Tucker, V Jampani… - Proceedings of the …, 2021 - openaccess.thecvf.com

We introduce the problem of perpetual view generation-long-range generation of novel
views corresponding to an arbitrarily long camera trajectory given a single image. This is a …

被引用次数：147 相关文章所有 5 个版本

[PDF] thecvf.com

Pixelsynth: Generating a 3d-consistent experience from a single image

C Rockwell, DF Fouhey… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com

Recent advancements in differentiable rendering and 3D reasoning have driven exciting
results in novel view synthesis from a single image. Despite realistic results, methods are …

被引用次数：84 相关文章所有 7 个版本

[PDF] thecvf.com

Persistent nature: A generative model of unbounded 3d worlds

L Chai, R Tucker, Z Li, P Isola… - Proceedings of the …, 2023 - openaccess.thecvf.com

Despite increasingly realistic image quality, recent 3D image generative models often
operate on 3D volumes of fixed extent with limited camera motions. We investigate the task …

被引用次数：23 相关文章所有 9 个版本

[PDF] neurips.cc

Training-free diffusion model adaptation for variable-sized text-to-image synthesis

Z Jin, X Shen, B Li, X Xue - Advances in Neural Information …, 2023 - proceedings.neurips.cc

Diffusion models (DMs) have recently gained attention with state-of-the-art performance in
text-to-image synthesis. Abiding by the tradition in deep learning, DMs are trained and …

被引用次数：22 相关文章所有 6 个版本

[PDF] arxiv.org

Any-resolution training for high-resolution image synthesis

L Chai, M Gharbi, E Shechtman, P Isola… - European Conference on …, 2022 - Springer

Generative models operate at fixed resolution, even though natural images come in a variety
of sizes. As high-resolution details are downsampled away and low-resolution images are …

被引用次数：69 相关文章所有 6 个版本