Image generation: A review

M Elasri, O Elharrouss, S Al-Maadeed, H Tairi - Neural Processing Letters, 2022 - Springer
The creation of an image from another and from different types of data including text, scene
graph, and object layout, is one of the very challenging tasks in computer vision. In addition …

Video generative adversarial networks: a review

N Aldausari, A Sowmya, N Marcus… - ACM Computing Surveys …, 2022 - dl.acm.org
With the increasing interest in the content creation field in multiple sectors such as media,
education, and entertainment, there is an increased trend in the papers that use AI …

Conditional image-to-video generation with latent flow diffusion models

H Ni, C Shi, K Li, SX Huang… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Conditional image-to-video (cI2V) generation aims to synthesize a new plausible video
starting from an image (eg, a person's face) and a condition (eg, an action class label like …

A dynamic multi-scale voxel flow network for video prediction

X Hu, Z Huang, A Huang, J Xu… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
The performance of video prediction has been greatly boosted by advanced deep neural
networks. However, most of the current methods suffer from large model sizes and require …

Few-shot video-to-video synthesis

TC Wang, MY Liu, A Tao, G Liu, J Kautz… - arXiv preprint arXiv …, 2019 - arxiv.org
Video-to-video synthesis (vid2vid) aims at converting an input semantic video, such as
videos of human poses or segmentation masks, to an output photorealistic video. While the …

Latent image animator: Learning to animate images via latent space navigation

Y Wang, D Yang, F Bremond, A Dantcheva - arXiv preprint arXiv …, 2022 - arxiv.org
Due to the remarkable progress of deep generative models, animating images has become
increasingly efficient, whereas associated results have become increasingly realistic …

Generating representative samples for few-shot classification

J Xu, H Le - Proceedings of the IEEE/CVF Conference on …, 2022 - openaccess.thecvf.com
Few-shot learning (FSL) aims to learn new categories with a few visual samples per class.
Few-shot class representations are often biased due to data scarcity. To mitigate this issue …

World-consistent video-to-video synthesis

A Mallya, TC Wang, K Sapra, MY Liu - … , Glasgow, UK, August 23–28, 2020 …, 2020 - Springer
Video-to-video synthesis (vid2vid) aims for converting high-level semantic inputs to
photorealistic videos. While existing vid2vid methods can achieve short-term temporal …

Ccvs: Context-aware controllable video synthesis

G Le Moing, J Ponce, C Schmid - Advances in Neural …, 2021 - proceedings.neurips.cc
This presentation introduces a self-supervised learning approach to the synthesis of new
videos clips from old ones, with several new key elements for improved spatial resolution …

Latent video transformer

R Rakhimov, D Volkhonskiy, A Artemov, D Zorin… - arXiv preprint arXiv …, 2020 - arxiv.org
The video generation task can be formulated as a prediction of future video frames given
some past frames. Recent generative models for videos face the problem of high …