Encoder-based domain tuning for fast personalization of text-to-image models

R Gal, M Arar, Y Atzmon, AH Bermano… - ACM Transactions on …, 2023 - dl.acm.org
Text-to-image personalization aims to teach a pre-trained diffusion model to reason about
novel, user provided concepts, embedding them into new scenes guided by natural …

Gan inversion: A survey

W Xia, Y Zhang, Y Yang, JH Xue… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
GAN inversion aims to invert a given image back into the latent space of a pretrained GAN
model so that the image can be faithfully reconstructed from the inverted code by the …

Styleheat: One-shot high-resolution editable talking face generation via pre-trained stylegan

F Yin, Y Zhang, X Cun, M Cao, Y Fan, X Wang… - European conference on …, 2022 - Springer
One-shot talking face generation aims at synthesizing a high-quality talking face video from
an arbitrary portrait image, driven by a video or an audio segment. In this work, we provide a …

High-fidelity 3d gan inversion by pseudo-multi-view optimization

J Xie, H Ouyang, J Piao, C Lei… - Proceedings of the …, 2023 - openaccess.thecvf.com
We present a high-fidelity 3D generative adversarial network (GAN) inversion framework
that can synthesize photo-realistic novel views while preserving specific details of the input …

Domain-agnostic tuning-encoder for fast personalization of text-to-image models

M Arar, R Gal, Y Atzmon, G Chechik… - SIGGRAPH Asia 2023 …, 2023 - dl.acm.org
Text-to-image (T2I) personalization allows users to guide the creative image generation
process by combining their own visual concepts in natural language prompts. Recently …

Hyperreenact: one-shot reenactment via jointly learning to refine and retarget faces

S Bounareli, C Tzelepis, V Argyriou… - Proceedings of the …, 2023 - openaccess.thecvf.com
In this paper, we present our method for neural face reenactment, called HyperReenact, that
aims to generate realistic talking head images of a source identity, driven by a target facial …

3d gan inversion with facial symmetry prior

F Yin, Y Zhang, X Wang, T Wang, X Li… - Proceedings of the …, 2023 - openaccess.thecvf.com
Recently, a surge of high-quality 3D-aware GANs have been proposed, which leverage the
generative power of neural rendering. It is natural to associate 3D GANs with GAN inversion …

Image synthesis under limited data: A survey and taxonomy

M Yang, Z Wang - arXiv preprint arXiv:2307.16879, 2023 - arxiv.org
Deep generative models, which target reproducing the given data distribution to produce
novel samples, have made unprecedented advancements in recent years. Their technical …

Position, padding and predictions: A deeper look at position information in cnns

MA Islam, M Kowal, S Jia, KG Derpanis… - International Journal of …, 2024 - Springer
In contrast to fully connected networks, Convolutional Neural Networks (CNNs) achieve
efficiency by learning weights associated with local filters with a finite spatial extent …

Learning 3d-aware image synthesis with unknown pose distribution

Z Shi, Y Shen, Y Xu, S Peng, Y Liao… - Proceedings of the …, 2023 - openaccess.thecvf.com
Existing methods for 3D-aware image synthesis largely depend on the 3D pose distribution
pre-estimated on the training set. An inaccurate estimation may mislead the model into …