Consistent diffusion models: Mitigating sampling drift by learning to be consistent

G Daras, Y Dagan, A Dimakis… - Advances in Neural …, 2024 - proceedings.neurips.cc
Imperfect score-matching leads to a shift between the training and the sampling distribution
of diffusion models. Due to the recursive nature of the generation process, errors in previous …

Structure-Guided Adversarial Training of Diffusion Models

L Yang, H Qian, Z Zhang, J Liu… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Diffusion models have demonstrated exceptional efficacy in various generative applications.
While existing models focus on minimizing a weighted sum of denoising score matching …

Unsupervised vocal dereverberation with diffusion-based generative models

K Saito, N Murata, T Uesaka, CH Lai… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
Removing reverb from reverberant music is a necessary technique to clean up audio for
downstream music manipulations. Reverberation of music contains two categories, natural …

Hierarchical diffusion models for singing voice neural vocoder

N Takahashi, M Kumar, Y Mitsufuji - ICASSP 2023-2023 IEEE …, 2023 - ieeexplore.ieee.org
Recent progress in deep generative models has improved the quality of neural vocoders in
speech domain. However, generating a high-quality singing voice remains challenging due …

Diffroll: Diffusion-based generative music transcription with unsupervised pretraining capability

KW Cheuk, R Sawata, T Uesaka… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
In this paper we propose a novel generative approach, DiffRoll, to tackle automatic music
transcription (AMT). Instead of treating AMT as a discriminative task in which the model is …

Score-based physics-informed neural networks for high-dimensional Fokker-Planck equations

Z Hu, Z Zhang, GE Karniadakis… - arXiv preprint arXiv …, 2024 - arxiv.org
The Fokker-Planck (FP) equation is a foundational PDE in stochastic processes. However,
curse of dimensionality (CoD) poses challenge when dealing with high-dimensional FP …

On Error Propagation of Diffusion Models

Y Li, M van der Schaar - The Twelfth International Conference on …, 2023 - openreview.net
Although diffusion models (DMs) have shown promising performances in a number of tasks
(eg, speech synthesis and image generation), they might suffer from error propagation …

Particle Denoising Diffusion Sampler

A Phillips, HD Dau, MJ Hutchinson, V De Bortoli… - arXiv preprint arXiv …, 2024 - arxiv.org
Denoising diffusion models have become ubiquitous for generative modeling. The core idea
is to transport the data distribution to a Gaussian by using a diffusion. Approximate samples …

Score-fPINN: Fractional Score-Based Physics-Informed Neural Networks for High-Dimensional Fokker-Planck-Levy Equations

Z Hu, Z Zhang, GE Karniadakis… - arXiv preprint arXiv …, 2024 - arxiv.org
We introduce an innovative approach for solving high-dimensional Fokker-Planck-L\'evy
(FPL) equations in modeling non-Brownian processes across disciplines such as physics …

Do diffusion models suffer error propagation? theoretical analysis and consistency regularization

Y Li, Z Qian, M van der Schaar - arXiv preprint arXiv:2308.05021, 2023 - arxiv.org
While diffusion models have achieved promising performances in data synthesis, they might
suffer error propagation because of their cascade structure, where the distributional …