A survey on deep neural network pruning: Taxonomy, comparison, analysis, and recommendations
Modern deep neural networks, particularly recent large language models, come with
massive model sizes that require significant computational and storage resources. To …
massive model sizes that require significant computational and storage resources. To …
Diffusion Models for Image Restoration and Enhancement--A Comprehensive Survey
Image restoration (IR) has been an indispensable and challenging task in the low-level
vision field, which strives to improve the subjective quality of images distorted by various …
vision field, which strives to improve the subjective quality of images distorted by various …
Exploiting diffusion prior for real-world image super-resolution
We present a novel approach to leverage prior knowledge encapsulated in pre-trained text-
to-image diffusion models for blind super-resolution. Specifically, by employing our time …
to-image diffusion models for blind super-resolution. Specifically, by employing our time …
Deepcache: Accelerating diffusion models for free
Diffusion models have recently gained unprecedented attention in the field of image
synthesis due to their remarkable generative capabilities. Notwithstanding their prowess …
synthesis due to their remarkable generative capabilities. Notwithstanding their prowess …
Diffusion model as representation learner
Abstract Diffusion Probabilistic Models (DPMs) have recently demonstrated impressive
results on various generative tasks. Despite its promises, the learned representations of pre …
results on various generative tasks. Despite its promises, the learned representations of pre …
In-context learning unlocked for diffusion models
Abstract We present Prompt Diffusion, a framework for enabling in-context learning in
diffusion-based generative models. Given a pair of task-specific example images, such as …
diffusion-based generative models. Given a pair of task-specific example images, such as …
Sg-former: Self-guided transformer with evolving token reallocation
Vision Transformer has demonstrated impressive success across various vision tasks.
However, its heavy computation cost, which grows quadratically with respect to the token …
However, its heavy computation cost, which grows quadratically with respect to the token …
Priority-centric human motion generation in discrete latent space
Text-to-motion generation is a formidable task, aiming to produce human motions that align
with the input text while also adhering to human capabilities and physical laws. While there …
with the input text while also adhering to human capabilities and physical laws. While there …
On the design fundamentals of diffusion models: A survey
Diffusion models are generative models, which gradually add and remove noise to learn the
underlying distribution of training data for data generation. The components of diffusion …
underlying distribution of training data for data generation. The components of diffusion …
Taming mode collapse in score distillation for text-to-3d generation
Despite the remarkable performance of score distillation in text-to-3D generation such
techniques notoriously suffer from view inconsistency issues also known as" Janus" artifact …
techniques notoriously suffer from view inconsistency issues also known as" Janus" artifact …