Sora: A review on background, technology, limitations, and opportunities of large vision models
Sora is a text-to-video generative AI model, released by OpenAI in February 2024. The
model is trained to generate videos of realistic or imaginative scenes from text instructions …
model is trained to generate videos of realistic or imaginative scenes from text instructions …
Vbench: Comprehensive benchmark suite for video generative models
Video generation has witnessed significant advancements yet evaluating these models
remains a challenge. A comprehensive evaluation benchmark for video generation is …
remains a challenge. A comprehensive evaluation benchmark for video generation is …
When does Sora show: The beginning of TAO to imaginative intelligence and scenarios engineering
During our discussion at workshops for writing “What Does ChatGPT Say: The DAO from
Algorithmic Intelligence to Linguistic Intelligence”[1], we had expected the next milestone for …
Algorithmic Intelligence to Linguistic Intelligence”[1], we had expected the next milestone for …
Tc4d: Trajectory-conditioned text-to-4d generation
Recent techniques for text-to-4D generation synthesize dynamic 3D scenes using
supervision from pre-trained text-to-video models. However, existing representations, such …
supervision from pre-trained text-to-video models. However, existing representations, such …
Leo: Generative latent image animator for human video synthesis
Spatio-temporal coherency is a major challenge in synthesizing high quality videos,
particularly in synthesizing human videos that contain rich global and local deformations. To …
particularly in synthesizing human videos that contain rich global and local deformations. To …
Advances in 3d generation: A survey
Generating 3D models lies at the core of computer graphics and has been the focus of
decades of research. With the emergence of advanced neural representations and …
decades of research. With the emergence of advanced neural representations and …
Anyv2v: A plug-and-play framework for any video-to-video editing tasks
Video-to-video editing involves editing a source video along with additional control (such as
text prompts, subjects, or styles) to generate a new video that aligns with the source video …
text prompts, subjects, or styles) to generate a new video that aligns with the source video …
Mimicmotion: High-quality human motion video generation with confidence-aware pose guidance
In recent years, generative artificial intelligence has achieved significant advancements in
the field of image generation, spawning a variety of applications. However, video generation …
the field of image generation, spawning a variety of applications. However, video generation …
Miradata: A large-scale video dataset with long durations and structured captions
Sora's high-motion intensity and long consistent videos have significantly impacted the field
of video generation, attracting unprecedented attention. However, existing publicly available …
of video generation, attracting unprecedented attention. However, existing publicly available …
Motionbooth: Motion-aware customized text-to-video generation
In this work, we present MotionBooth, an innovative framework designed for animating
customized subjects with precise control over both object and camera movements. By …
customized subjects with precise control over both object and camera movements. By …