Videocrafter1: Open diffusion models for high-quality video generation

L Hu - Proceedings of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com

Character Animation aims to generating character videos from still images through driving
signals. Currently diffusion models have become the mainstream in visual generation …

被引用次数：125 相关文章所有 3 个版本

[PDF] thecvf.com

Vbench: Comprehensive benchmark suite for video generative models

Z Huang, Y He, J Yu, F Zhang, C Si… - Proceedings of the …, 2024 - openaccess.thecvf.com

Video generation has witnessed significant advancements yet evaluating these models
remains a challenge. A comprehensive evaluation benchmark for video generation is …

被引用次数：68 相关文章所有 4 个版本

[PDF] thecvf.com

Videocrafter2: Overcoming data limitations for high-quality video diffusion models

H Chen, Y Zhang, X Cun, M Xia… - Proceedings of the …, 2024 - openaccess.thecvf.com

Text-to-video generation aims to produce a video based on a given prompt. Recently
several commercial video models have been able to generate plausible videos with minimal …

被引用次数：62 相关文章所有 3 个版本

[PDF] thecvf.com

Evalcrafter: Benchmarking and evaluating large video generation models

Y Liu, X Cun, X Liu, X Wang, Y Zhang… - Proceedings of the …, 2024 - openaccess.thecvf.com

The vision and language generative models have been overgrown in recent years. For
video generation various open-sourced models and public-available services have been …

被引用次数：47 相关文章所有 3 个版本

[PDF] arxiv.org

Videopoet: A large language model for zero-shot video generation

D Kondratyuk, L Yu, X Gu, J Lezama, J Huang… - arXiv preprint arXiv …, 2023 - arxiv.org

We present VideoPoet, a language model capable of synthesizing high-quality video, with
matching audio, from a large variety of conditioning signals. VideoPoet employs a decoder …

被引用次数：74 相关文章所有 5 个版本

[PDF] thecvf.com

Dreamvideo: Composing your dream videos with customized subject and motion

Y Wei, S Zhang, Z Qing, H Yuan, Z Liu… - Proceedings of the …, 2024 - openaccess.thecvf.com

Customized generation using diffusion models has made impressive progress in image
generation but remains unsatisfactory in the challenging video generation task as it requires …

被引用次数：31 相关文章所有 4 个版本

[PDF] ieee.org

When does Sora show: The beginning of TAO to imaginative intelligence and scenarios engineering

FY Wang, Q Miao, L Li, Q Ni, X Li, J Li… - IEEE/CAA Journal of …, 2024 - ieeexplore.ieee.org

During our discussion at workshops for writing “What Does ChatGPT Say: The DAO from
Algorithmic Intelligence to Linguistic Intelligence”[1], we had expected the next milestone for …

被引用次数：40 相关文章所有 4 个版本

[PDF] arxiv.org

I2vgen-xl: High-quality image-to-video synthesis via cascaded diffusion models

S Zhang, J Wang, Y Zhang, K Zhao, H Yuan… - arXiv preprint arXiv …, 2023 - arxiv.org

Video synthesis has recently made remarkable strides benefiting from the rapid
development of diffusion models. However, it still encounters challenges in terms of …

被引用次数：83 相关文章所有 2 个版本

Motionctrl: A unified and flexible motion controller for video generation

Z Wang, Z Yuan, X Wang, Y Li, T Chen, M Xia… - ACM SIGGRAPH 2024 …, 2024 - dl.acm.org

Motions in a video primarily consist of camera motion, induced by camera movement, and
object motion, resulting from object movement. Accurate control of both camera and object …

被引用次数：49 相关文章所有 2 个版本

[PDF] openreview.net

Scalecrafter: Tuning-free higher-resolution visual generation with diffusion models

Y He, S Yang, H Chen, X Cun, M Xia… - The Twelfth …, 2023 - openreview.net

In this work, we investigate the capability of generating images from pre-trained diffusion
models at much higher resolutions than the training image sizes. In addition, the generated …

被引用次数：24 相关文章所有 3 个版本