Geometric gan - 学术资源搜索

Deep generative modelling: A comparative review of vaes, gans, normalizing flows, energy-based and autoregressive models

S Bond-Taylor, A Leach, Y Long… - IEEE transactions on …, 2021 - ieeexplore.ieee.org

Deep generative models are a class of techniques that train deep neural networks to model
the distribution of training samples. Research has fragmented into various interconnected …

被引用次数：443 相关文章所有 12 个版本

[PDF] arxiv.org

A review on generative adversarial networks: Algorithms, theory, and applications

J Gui, Z Sun, Y Wen, D Tao, J Ye - IEEE transactions on …, 2021 - ieeexplore.ieee.org

Generative adversarial networks (GANs) have recently become a hot research topic;
however, they have been studied since 2014, and a large number of algorithms have been …

被引用次数：943 相关文章所有 13 个版本

[PDF] mlr.press

Stylegan-t: Unlocking the power of gans for fast large-scale text-to-image synthesis

A Sauer, T Karras, S Laine… - … on machine learning, 2023 - proceedings.mlr.press

Text-to-image synthesis has recently seen significant progress thanks to large pretrained
language models, large-scale training data, and the introduction of scalable model families …

被引用次数：142 相关文章所有 8 个版本

[PDF] neurips.cc

High-fidelity audio compression with improved rvqgan

R Kumar, P Seetharaman, A Luebs… - Advances in Neural …, 2024 - proceedings.neurips.cc

Abstract Language models have been successfully used to model natural signals, such as
images, speech, and music. A key component of these models is a high quality neural …

被引用次数：80 相关文章所有 5 个版本

[PDF] arxiv.org

Adversarial diffusion distillation

A Sauer, D Lorenz, A Blattmann… - arXiv preprint arXiv …, 2023 - arxiv.org

We introduce Adversarial Diffusion Distillation (ADD), a novel training approach that
efficiently samples large-scale foundational image diffusion models in just 1-4 steps while …

被引用次数：81 相关文章所有 2 个版本

[PDF] thecvf.com

Styleswin: Transformer-based gan for high-resolution image generation

B Zhang, S Gu, B Zhang, J Bao… - Proceedings of the …, 2022 - openaccess.thecvf.com

Despite the tantalizing success in a broad of vision tasks, transformers have not yet
demonstrated on-par ability as ConvNets in high-resolution image generative modeling. In …

被引用次数：209 相关文章所有 7 个版本

[PDF] arxiv.org

A survey on neural speech synthesis

X Tan, T Qin, F Soong, TY Liu - arXiv preprint arXiv:2106.15561, 2021 - arxiv.org

Text to speech (TTS), or speech synthesis, which aims to synthesize intelligible and natural
speech given text, is a hot research topic in speech, language, and machine learning …

被引用次数：354 相关文章所有 2 个版本

[PDF] arxiv.org

Sinnerf: Training neural radiance fields on complex scenes from a single image

D Xu, Y Jiang, P Wang, Z Fan, H Shi… - European Conference on …, 2022 - Springer

Despite the rapid development of Neural Radiance Field (NeRF), the necessity of dense
covers largely prohibits its wider applications. While several recent works have attempted to …

被引用次数：134 相关文章所有 5 个版本

[PDF] thecvf.com

Cross-modal contrastive learning for text-to-image generation

H Zhang, JY Koh, J Baldridge… - Proceedings of the …, 2021 - openaccess.thecvf.com

The output of text-to-image synthesis systems should be coherent, clear, photo-realistic
scenes with high semantic fidelity to their conditioned text descriptions. Our Cross-Modal …

被引用次数：337 相关文章所有 6 个版本

[PDF] thecvf.com

Pd-gan: Probabilistic diverse gan for image inpainting

H Liu, Z Wan, W Huang, Y Song… - Proceedings of the …, 2021 - openaccess.thecvf.com

We propose PD-GAN, a probabilistic diverse GAN forimage inpainting. Given an input image
with arbitrary holeregions, PD-GAN produces multiple inpainting results withdiverse and …

被引用次数：216 相关文章所有 7 个版本