Deep learning for visual tracking: A comprehensive survey
SM Marvasti-Zadeh, L Cheng… - IEEE Transactions …, 2021 - ieeexplore.ieee.org
Visual target tracking is one of the most sought-after yet challenging research topics in
computer vision. Given the ill-posed nature of the problem and its popularity in a broad …
computer vision. Given the ill-posed nature of the problem and its popularity in a broad …
Video object segmentation and tracking: A survey
Object segmentation and object tracking are fundamental research areas in the computer
vision community. These two topics are difficult to handle some common challenges, such …
vision community. These two topics are difficult to handle some common challenges, such …
Visual prompt multi-modal tracking
Visible-modal object tracking gives rise to a series of downstream multi-modal tracking
tributaries. To inherit the powerful representations of the foundation model, a natural modus …
tributaries. To inherit the powerful representations of the foundation model, a natural modus …
Sam 2: Segment anything in images and videos
We present Segment Anything Model 2 (SAM 2), a foundation model towards solving
promptable visual segmentation in images and videos. We build a data engine, which …
promptable visual segmentation in images and videos. We build a data engine, which …
MOSE: A new dataset for video object segmentation in complex scenes
Video object segmentation (VOS) aims at segmenting a particular object throughout the
entire video clip sequence. The state-of-the-art VOS methods have achieved excellent …
entire video clip sequence. The state-of-the-art VOS methods have achieved excellent …
Towards an end-to-end framework for flow-guided video inpainting
Optical flow, which captures motion information across frames, is exploited in recent video
inpainting methods through propagating pixels along its trajectories. However, the hand …
inpainting methods through propagating pixels along its trajectories. However, the hand …
Propainter: Improving propagation and transformer for video inpainting
Flow-based propagation and spatiotemporal Transformer are two mainstream mechanisms
in video inpainting (VI). Despite the effectiveness of these components, they still suffer from …
in video inpainting (VI). Despite the effectiveness of these components, they still suffer from …
Siam r-cnn: Visual tracking by re-detection
Abstract We present Siam R-CNN, a Siamese re-detection architecture which unleashes the
full power of two-stage object detection approaches for visual object tracking. We combine …
full power of two-stage object detection approaches for visual object tracking. We combine …
End-to-end referring video object segmentation with multimodal transformers
A Botach, E Zheltonozhskii… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
The referring video object segmentation task (RVOS) involves segmentation of a text-
referred object instance in the frames of a given video. Due to the complex nature of this …
referred object instance in the frames of a given video. Due to the complex nature of this …
Video object segmentation with episodic graph memory networks
How to make a segmentation model efficiently adapt to a specific video as well as online
target appearance variations is a fundamental issue in the field of video object …
target appearance variations is a fundamental issue in the field of video object …