Deep generative modelling: A comparative review of vaes, gans, normalizing flows, energy-based and autoregressive models
Deep generative models are a class of techniques that train deep neural networks to model
the distribution of training samples. Research has fragmented into various interconnected …
the distribution of training samples. Research has fragmented into various interconnected …
High-resolution image synthesis with latent diffusion models
R Rombach, A Blattmann, D Lorenz… - Proceedings of the …, 2022 - openaccess.thecvf.com
By decomposing the image formation process into a sequential application of denoising
autoencoders, diffusion models (DMs) achieve state-of-the-art synthesis results on image …
autoencoders, diffusion models (DMs) achieve state-of-the-art synthesis results on image …
Score-based generative modeling in latent space
Score-based generative models (SGMs) have recently demonstrated impressive results in
terms of both sample quality and distribution coverage. However, they are usually applied …
terms of both sample quality and distribution coverage. However, they are usually applied …
Perception prioritized training of diffusion models
Diffusion models learn to restore noisy data, which is corrupted with different levels of noise,
by optimizing the weighted sum of the corresponding loss terms, ie, denoising score …
by optimizing the weighted sum of the corresponding loss terms, ie, denoising score …
Vector-quantized image modeling with improved vqgan
Pretraining language models with next-token prediction on massive text corpora has
delivered phenomenal zero-shot, few-shot, transfer learning and multi-tasking capabilities …
delivered phenomenal zero-shot, few-shot, transfer learning and multi-tasking capabilities …
Score-based generative modeling with critically-damped langevin diffusion
Score-based generative models (SGMs) have demonstrated remarkable synthesis quality.
SGMs rely on a diffusion process that gradually perturbs the data towards a tractable …
SGMs rely on a diffusion process that gradually perturbs the data towards a tractable …
Wavelet diffusion models are fast and scalable image generators
Diffusion models are rising as a powerful solution for high-fidelity image generation, which
exceeds GANs in quality in many circumstances. However, their slow training and inference …
exceeds GANs in quality in many circumstances. However, their slow training and inference …
Diff-tts: A denoising diffusion model for text-to-speech
Although neural text-to-speech (TTS) models have attracted a lot of attention and succeeded
in generating human-like speech, there is still room for improvements to its naturalness and …
in generating human-like speech, there is still room for improvements to its naturalness and …
Improved transformer for high-resolution gans
Attention-based models, exemplified by the Transformer, can effectively model long range
dependency, but suffer from the quadratic complexity of self-attention operation, making …
dependency, but suffer from the quadratic complexity of self-attention operation, making …
Controllable and compositional generation with latent-space energy-based models
Controllable generation is one of the key requirements for successful adoption of deep
generative models in real-world applications, but it still remains as a great challenge. In …
generative models in real-world applications, but it still remains as a great challenge. In …