Video generation from single semantic label map

M Elasri, O Elharrouss, S Al-Maadeed, H Tairi - Neural Processing Letters, 2022 - Springer

The creation of an image from another and from different types of data including text, scene
graph, and object layout, is one of the very challenging tasks in computer vision. In addition …

被引用次数：92 相关文章所有 6 个版本

[PDF] acm.org

Video generative adversarial networks: a review

N Aldausari, A Sowmya, N Marcus… - ACM Computing Surveys …, 2022 - dl.acm.org

With the increasing interest in the content creation field in multiple sectors such as media,
education, and entertainment, there is an increased trend in the papers that use AI …

被引用次数：138 相关文章所有 7 个版本

[PDF] thecvf.com

Conditional image-to-video generation with latent flow diffusion models

H Ni, C Shi, K Li, SX Huang… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Conditional image-to-video (cI2V) generation aims to synthesize a new plausible video
starting from an image (eg, a person's face) and a condition (eg, an action class label like …

被引用次数：142 相关文章所有 6 个版本

[PDF] thecvf.com

A dynamic multi-scale voxel flow network for video prediction

X Hu, Z Huang, A Huang, J Xu… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

The performance of video prediction has been greatly boosted by advanced deep neural
networks. However, most of the current methods suffer from large model sizes and require …

被引用次数：75 相关文章所有 8 个版本

[PDF] arxiv.org

Few-shot video-to-video synthesis

TC Wang, MY Liu, A Tao, G Liu, J Kautz… - arXiv preprint arXiv …, 2019 - arxiv.org

Video-to-video synthesis (vid2vid) aims at converting an input semantic video, such as
videos of human poses or segmentation masks, to an output photorealistic video. While the …

被引用次数：418 相关文章所有 10 个版本

[PDF] arxiv.org

Latent image animator: Learning to animate images via latent space navigation

Y Wang, D Yang, F Bremond, A Dantcheva - arXiv preprint arXiv …, 2022 - arxiv.org

Due to the remarkable progress of deep generative models, animating images has become
increasingly efficient, whereas associated results have become increasingly realistic …

被引用次数：153 相关文章所有 7 个版本

[PDF] thecvf.com

Generating representative samples for few-shot classification

J Xu, H Le - Proceedings of the IEEE/CVF Conference on …, 2022 - openaccess.thecvf.com

Few-shot learning (FSL) aims to learn new categories with a few visual samples per class.
Few-shot class representations are often biased due to data scarcity. To mitigate this issue …

被引用次数：80 相关文章所有 6 个版本

[PDF] arxiv.org

World-consistent video-to-video synthesis

A Mallya, TC Wang, K Sapra, MY Liu - … , Glasgow, UK, August 23–28, 2020 …, 2020 - Springer

Video-to-video synthesis (vid2vid) aims for converting high-level semantic inputs to
photorealistic videos. While existing vid2vid methods can achieve short-term temporal …

被引用次数：111 相关文章所有 4 个版本

[PDF] neurips.cc

Ccvs: Context-aware controllable video synthesis

G Le Moing, J Ponce, C Schmid - Advances in Neural …, 2021 - proceedings.neurips.cc

This presentation introduces a self-supervised learning approach to the synthesis of new
videos clips from old ones, with several new key elements for improved spatial resolution …

被引用次数：76 相关文章所有 8 个版本

[PDF] arxiv.org

Latent video transformer

R Rakhimov, D Volkhonskiy, A Artemov, D Zorin… - arXiv preprint arXiv …, 2020 - arxiv.org

The video generation task can be formulated as a prediction of future video frames given
some past frames. Recent generative models for videos face the problem of high …

被引用次数：127 相关文章所有 6 个版本