Pros and cons of GAN evaluation measures: New developments

A Borji - Computer Vision and Image Understanding, 2022 - Elsevier
This work is an update of my previous paper on the same topic published a few years ago
(Borji, 2019). With the dramatic progress in generative modeling, a suite of new quantitative …

Generative adversarial networks for face generation: A survey

A Kammoun, R Slama, H Tabia, T Ouni… - ACM Computing …, 2022 - dl.acm.org
Recently, generative adversarial networks (GANs) have progressed enormously, which
makes them able to learn complex data distributions in particular faces. More and more …

Imagereward: Learning and evaluating human preferences for text-to-image generation

J Xu, X Liu, Y Wu, Y Tong, Q Li… - Advances in …, 2024 - proceedings.neurips.cc
We present a comprehensive solution to learn and improve text-to-image models from
human preference feedback. To begin with, we build ImageReward---the first general …

Make-a-scene: Scene-based text-to-image generation with human priors

O Gafni, A Polyak, O Ashual, S Sheynin… - … on Computer Vision, 2022 - Springer
Recent text-to-image generation methods provide a simple yet exciting conversion capability
between text and image domains. While these methods have incrementally improved the …

Cold diffusion: Inverting arbitrary image transforms without noise

A Bansal, E Borgnia, HM Chu, J Li… - Advances in …, 2024 - proceedings.neurips.cc
Standard diffusion models involve an image transform--adding Gaussian noise--and an
image restoration operator that inverts this degradation. We observe that the generative …

Diffusion models beat gans on image synthesis

P Dhariwal, A Nichol - Advances in neural information …, 2021 - proceedings.neurips.cc
We show that diffusion models can achieve image sample quality superior to the current
state-of-the-art generative models. We achieve this on unconditional image synthesis by …

Cross-modal contrastive learning for text-to-image generation

H Zhang, JY Koh, J Baldridge… - Proceedings of the …, 2021 - openaccess.thecvf.com
The output of text-to-image synthesis systems should be coherent, clear, photo-realistic
scenes with high semantic fidelity to their conditioned text descriptions. Our Cross-Modal …

Brain-inspired replay for continual learning with artificial neural networks

GM Van de Ven, HT Siegelmann, AS Tolias - Nature communications, 2020 - nature.com
Artificial neural networks suffer from catastrophic forgetting. Unlike humans, when these
networks are trained on something new, they rapidly forget what was learned before. In the …

Improved techniques for training score-based generative models

Y Song, S Ermon - Advances in neural information …, 2020 - proceedings.neurips.cc
Score-based generative models can produce high quality image samples comparable to
GANs, without requiring adversarial optimization. However, existing training procedures are …

Generating diverse high-fidelity images with vq-vae-2

A Razavi, A Van den Oord… - Advances in neural …, 2019 - proceedings.neurips.cc
We explore the use of Vector Quantized Variational AutoEncoder (VQ-VAE) models for large
scale image generation. To this end, we scale and enhance the autoregressive priors used …