Video diffusion models are strong video inpainter

M Lee, S Cho, C Shin, J Lee, S Yang, S Lee - arXiv preprint arXiv …, 2024 - arxiv.org
Propagation-based video inpainting using optical flow at the pixel or feature level has
recently garnered significant attention. However, it has limitations such as the inaccuracy of …

[PDF][PDF] Beyond the Field-of-View: Enhancing Scene Visibility and Perception with Clip-Recurrent Transformer

H Shi, Q Jiang, K Yang, X Yin, Z Wang… - arXiv preprint arXiv …, 2022 - researchgate.net
Limited by hardware cost and system size, camera's Field-of-View (FoV) is not always
satisfactory. However, from a spatio-temporal perspective, information beyond the camera's …

Beyond the Field-of-View: Enhancing Scene Visibility and Perception with Clip-Recurrent Transformer

H Shi, Q Jiang, K Yang, X Yin, H Ni… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Vision sensors are widely applied in vehicles, robots, and roadside infrastructure. However,
due to limitations in hardware cost and system size, camera Field-of-View (FoV) is often …

Optical-Flow Guided Prompt Optimization for Coherent Video Generation

H Nam, J Kim, D Lee, JC Ye - arXiv preprint arXiv:2411.15540, 2024 - arxiv.org
While text-to-video diffusion models have made significant strides, many still face challenges
in generating videos with temporal consistency. Within diffusion frameworks, guidance …

Replace Anyone in Videos

X Wang, C Gao, Y Wang, N Sang - arXiv preprint arXiv:2409.19911, 2024 - arxiv.org
Recent advancements in controllable human-centric video generation, particularly with the
rise of diffusion models, have demonstrated considerable progress. However, achieving …

V-LASIK: Consistent Glasses-Removal from Videos Using Synthetic Data

R Shalev-Arkushin, A Azulay, T Halperin… - arXiv preprint arXiv …, 2024 - arxiv.org
Diffusion-based generative models have recently shown remarkable image and video
editing capabilities. However, local video editing, particularly removal of small attributes like …

[PDF][PDF] 基于深度学习的图像修复方法研究进展.

陈文祥, 田启川, 廉露, 张晓行… - Journal of Computer …, 2024 - lib.zjsru.edu.cn
图像修复是通过算法或技术对受损或缺失的图像进行恢复和修复的过程, 是计算机视觉领域的
研究热点之一. 梳理了近些年基于深度学习的图像修复方法的发展脉络, 将其分为单模态图像 …

Text-Video Completion Networks With Motion Compensation And Attention Aggregation

J Wang, Z Wu, H Xuan, Y Yan - ICASSP 2024-2024 IEEE …, 2024 - ieeexplore.ieee.org
The purpose of video inpainting is to fill a specified area with reasonable content. However,
in the case of multiple targets and complex textures, current methods struggle to distinguish …

[PDF][PDF] VipDiff: Towards Coherent and Diverse Video Inpainting via Training-free Denoising Diffusion Models

C Xie, K Han, KYK Wong - i.cs.hku.hk
Recent video inpainting methods have achieved encouraging improvements by leveraging
optical flow to guide pixel propagation from reference frames, either in the image space or …