A survey on deep neural network pruning: Taxonomy, comparison, analysis, and recommendations

H Cheng, M Zhang, JQ Shi - IEEE Transactions on Pattern …, 2024 - ieeexplore.ieee.org
Modern deep neural networks, particularly recent large language models, come with
massive model sizes that require significant computational and storage resources. To …

Diffusion Models for Image Restoration and Enhancement--A Comprehensive Survey

X Li, Y Ren, X Jin, C Lan, X Wang, W Zeng… - arXiv preprint arXiv …, 2023 - arxiv.org
Image restoration (IR) has been an indispensable and challenging task in the low-level
vision field, which strives to improve the subjective quality of images distorted by various …

Exploiting diffusion prior for real-world image super-resolution

J Wang, Z Yue, S Zhou, KCK Chan, CC Loy - International Journal of …, 2024 - Springer
We present a novel approach to leverage prior knowledge encapsulated in pre-trained text-
to-image diffusion models for blind super-resolution. Specifically, by employing our time …

Deepcache: Accelerating diffusion models for free

X Ma, G Fang, X Wang - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
Diffusion models have recently gained unprecedented attention in the field of image
synthesis due to their remarkable generative capabilities. Notwithstanding their prowess …

Diffusion model as representation learner

X Yang, X Wang - … of the IEEE/CVF International Conference …, 2023 - openaccess.thecvf.com
Abstract Diffusion Probabilistic Models (DPMs) have recently demonstrated impressive
results on various generative tasks. Despite its promises, the learned representations of pre …

In-context learning unlocked for diffusion models

Z Wang, Y Jiang, Y Lu, P He, W Chen… - Advances in …, 2023 - proceedings.neurips.cc
Abstract We present Prompt Diffusion, a framework for enabling in-context learning in
diffusion-based generative models. Given a pair of task-specific example images, such as …

Sg-former: Self-guided transformer with evolving token reallocation

S Ren, X Yang, S Liu, X Wang - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Vision Transformer has demonstrated impressive success across various vision tasks.
However, its heavy computation cost, which grows quadratically with respect to the token …

Priority-centric human motion generation in discrete latent space

H Kong, K Gong, D Lian, MB Mi… - Proceedings of the …, 2023 - openaccess.thecvf.com
Text-to-motion generation is a formidable task, aiming to produce human motions that align
with the input text while also adhering to human capabilities and physical laws. While there …

On the design fundamentals of diffusion models: A survey

Z Chang, GA Koulieris, HPH Shum - arXiv preprint arXiv:2306.04542, 2023 - arxiv.org
Diffusion models are generative models, which gradually add and remove noise to learn the
underlying distribution of training data for data generation. The components of diffusion …

Taming mode collapse in score distillation for text-to-3d generation

P Wang, D Xu, Z Fan, D Wang… - Proceedings of the …, 2024 - openaccess.thecvf.com
Despite the remarkable performance of score distillation in text-to-3D generation such
techniques notoriously suffer from view inconsistency issues also known as" Janus" artifact …