Drag-a-video: Non-rigid video editing with point-based interaction

W Sun, RC Tu, J Liao, D Tao - arXiv preprint arXiv:2407.07111, 2024 - arxiv.org

The rapid development of diffusion models (DMs) has significantly advanced image and
video applications, making" what you want is what you see" a reality. Among these, video …

被引用次数：8 相关文章所有 2 个版本

[PDF] arxiv.org

Dge: Direct gaussian 3d editing by consistent multi-view editing

M Chen, I Laina, A Vedaldi - European Conference on Computer Vision, 2025 - Springer

We consider the problem of editing 3D objects and scenes based on open-ended language
instructions. A common approach to this problem is to use a 2D image generator or editor to …

被引用次数：11 相关文章所有 2 个版本

[PDF] arxiv.org

Dragapart: Learning a part-level motion prior for articulated objects

R Li, C Zheng, C Rupprecht, A Vedaldi - European Conference on …, 2025 - Springer

We introduce DragAPart, a method that, given an image and a set of drags as input,
generates a new image of the same object that responds to the action of the drags …

被引用次数：6 相关文章所有 2 个版本

[PDF] arxiv.org

ObjCtrl-2.5 D: Training-free Object Control with Camera Poses

Z Wang, Y Lan, S Zhou, CC Loy - arXiv preprint arXiv:2412.07721, 2024 - arxiv.org

This study aims to achieve more precise and versatile object control in image-to-video (I2V)
generation. Current methods typically represent the spatial movement of target objects with …

被引用次数：1 相关文章所有 3 个版本

[PDF] arxiv.org

Puppet-master: Scaling interactive video generation as a motion prior for part-level dynamics

R Li, C Zheng, C Rupprecht, A Vedaldi - arXiv preprint arXiv:2408.04631, 2024 - arxiv.org

We present Puppet-Master, an interactive video generative model that can serve as a motion
prior for part-level dynamics. At test time, given a single image and a sparse set of motion …

被引用次数：1 相关文章所有 2 个版本

[PDF] researchgate.net

[PDF][PDF] Conditional Video Generation Guided by Multimodal Inputs: A Comprehensive Survey

K Niu, W Liu, N Sharif, D Zhu - 2024 - researchgate.net

The field of video generation is rapidly evolving, driven by advancements in generative
models. This survey provides a comprehensive analysis of the diverse methodologies …

被引用次数：1 相关文章所有 2 个版本

[PDF] amazon.science

Zero-shot controllable image-to-video animation via motion decomposition

S Yu, JZ Fang, J Zheng, G Sigurdsson… - Proceedings of the …, 2024 - dl.acm.org

In this paper, we introduce a new challenging task called Zero-Shot Controllable Image-to-
Video Animation, where the goal is to animate an image based on motion trajectories …

被引用次数：4 相关文章所有 5 个版本

[PDF] arxiv.org

Flatten: optical flow-guided attention for consistent text-to-video editing

Y Cong, M Xu, C Simon, S Chen, J Ren, Y Xie… - arXiv preprint arXiv …, 2023 - arxiv.org

Text-to-video editing aims to edit the visual appearance of a source video conditional on
textual prompts. A major challenge in this task is to ensure that all frames in the edited video …

被引用次数：55 相关文章所有 7 个版本

[PDF] arxiv.org

Trajectory Attention for Fine-grained Video Motion Control

Z Xiao, W Ouyang, Y Zhou, S Yang, L Yang, J Si… - arXiv preprint arXiv …, 2024 - arxiv.org

Recent advancements in video generation have been greatly driven by video diffusion
models, with camera motion control emerging as a crucial challenge in creating view …

COMD: Training-free Video Motion Transfer With Camera-Object Motion Disentanglement

T Hu, J Zhang, R Yi, Y Wang, J Weng… - Proceedings of the …, 2024 - dl.acm.org

The emergence of diffusion models has greatly propelled the progress in image and video
generation. Recently, some efforts have been made in controllable video generation …