Patch diffusion: Faster and more data-efficient training of diffusion models

H Cheng, M Zhang, JQ Shi - IEEE Transactions on Pattern …, 2024 - ieeexplore.ieee.org

Modern deep neural networks, particularly recent large language models, come with
massive model sizes that require significant computational and storage resources. To …

被引用次数：40 相关文章所有 2 个版本

[PDF] arxiv.org

Diffusion Models for Image Restoration and Enhancement--A Comprehensive Survey

X Li, Y Ren, X Jin, C Lan, X Wang, W Zeng… - arXiv preprint arXiv …, 2023 - arxiv.org

Image restoration (IR) has been an indispensable and challenging task in the low-level
vision field, which strives to improve the subjective quality of images distorted by various …

被引用次数：51 相关文章所有 2 个版本

[PDF] arxiv.org

Exploiting diffusion prior for real-world image super-resolution

J Wang, Z Yue, S Zhou, KCK Chan, CC Loy - International Journal of …, 2024 - Springer

We present a novel approach to leverage prior knowledge encapsulated in pre-trained text-
to-image diffusion models for blind super-resolution. Specifically, by employing our time …

被引用次数：141 相关文章所有 2 个版本

[PDF] thecvf.com

Deepcache: Accelerating diffusion models for free

X Ma, G Fang, X Wang - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com

Diffusion models have recently gained unprecedented attention in the field of image
synthesis due to their remarkable generative capabilities. Notwithstanding their prowess …

被引用次数：41 相关文章所有 3 个版本

[PDF] thecvf.com

Diffusion model as representation learner

X Yang, X Wang - … of the IEEE/CVF International Conference …, 2023 - openaccess.thecvf.com

Abstract Diffusion Probabilistic Models (DPMs) have recently demonstrated impressive
results on various generative tasks. Despite its promises, the learned representations of pre …

被引用次数：38 相关文章所有 5 个版本

[PDF] neurips.cc

In-context learning unlocked for diffusion models

Z Wang, Y Jiang, Y Lu, P He, W Chen… - Advances in …, 2023 - proceedings.neurips.cc

Abstract We present Prompt Diffusion, a framework for enabling in-context learning in
diffusion-based generative models. Given a pair of task-specific example images, such as …

被引用次数：42 相关文章所有 6 个版本

[PDF] thecvf.com

Sg-former: Self-guided transformer with evolving token reallocation

S Ren, X Yang, S Liu, X Wang - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Vision Transformer has demonstrated impressive success across various vision tasks.
However, its heavy computation cost, which grows quadratically with respect to the token …

被引用次数：24 相关文章所有 5 个版本

[PDF] thecvf.com

Priority-centric human motion generation in discrete latent space

H Kong, K Gong, D Lian, MB Mi… - Proceedings of the …, 2023 - openaccess.thecvf.com

Text-to-motion generation is a formidable task, aiming to produce human motions that align
with the input text while also adhering to human capabilities and physical laws. While there …

被引用次数：21 相关文章所有 5 个版本

[PDF] arxiv.org

On the design fundamentals of diffusion models: A survey

Z Chang, GA Koulieris, HPH Shum - arXiv preprint arXiv:2306.04542, 2023 - arxiv.org

Diffusion models are generative models, which gradually add and remove noise to learn the
underlying distribution of training data for data generation. The components of diffusion …

被引用次数：42 相关文章所有 2 个版本

[PDF] thecvf.com

Taming mode collapse in score distillation for text-to-3d generation

P Wang, D Xu, Z Fan, D Wang… - Proceedings of the …, 2024 - openaccess.thecvf.com

Despite the remarkable performance of score distillation in text-to-3D generation such
techniques notoriously suffer from view inconsistency issues also known as" Janus" artifact …

被引用次数：10 相关文章所有 4 个版本