A review on generative adversarial networks: Algorithms, theory, and applications
Generative adversarial networks (GANs) have recently become a hot research topic;
however, they have been studied since 2014, and a large number of algorithms have been …
however, they have been studied since 2014, and a large number of algorithms have been …
An overview of voice conversion and its challenges: From statistical modeling to deep learning
Speaker identity is one of the important characteristics of human speech. In voice
conversion, we change the speaker identity from one to another, while keeping the linguistic …
conversion, we change the speaker identity from one to another, while keeping the linguistic …
Instructpix2pix: Learning to follow image editing instructions
We propose a method for editing images from human instructions: given an input image and
a written instruction that tells the model what to do, our model follows these instructions to …
a written instruction that tells the model what to do, our model follows these instructions to …
Spatext: Spatio-textual representation for controllable image generation
Recent text-to-image diffusion models are able to generate convincing results of
unprecedented quality. However, it is nearly impossible to control the shapes of different …
unprecedented quality. However, it is nearly impossible to control the shapes of different …
Palette: Image-to-image diffusion models
This paper develops a unified framework for image-to-image translation based on
conditional diffusion models and evaluates this framework on four challenging image-to …
conditional diffusion models and evaluates this framework on four challenging image-to …
Ilvr: Conditioning method for denoising diffusion probabilistic models
Denoising diffusion probabilistic models (DDPM) have shown remarkable performance in
unconditional image generation. However, due to the stochasticity of the generative process …
unconditional image generation. However, due to the stochasticity of the generative process …
Gan inversion: A survey
GAN inversion aims to invert a given image back into the latent space of a pretrained GAN
model so that the image can be faithfully reconstructed from the inverted code by the …
model so that the image can be faithfully reconstructed from the inverted code by the …
Contrastive learning for unpaired image-to-image translation
In image-to-image translation, each patch in the output should reflect the content of the
corresponding patch in the input, independent of domain. We propose a straightforward …
corresponding patch in the input, independent of domain. We propose a straightforward …
One-shot free-view neural talking-head synthesis for video conferencing
We propose a neural talking-head video synthesis model and demonstrate its application to
video conferencing. Our model learns to synthesize a talking-head video using a source …
video conferencing. Our model learns to synthesize a talking-head video using a source …
On aliased resizing and surprising subtleties in gan evaluation
Metrics for evaluating generative models aim to measure the discrepancy between real and
generated images. The oftenused Frechet Inception Distance (FID) metric, for example …
generated images. The oftenused Frechet Inception Distance (FID) metric, for example …