Putting the object back into video object segmentation

HK Cheng, SW Oh, B Price, JY Lee… - Proceedings of the …, 2024 - openaccess.thecvf.com
We present Cutie a video object segmentation (VOS) network with object-level memory
reading which puts the object representation from memory back into the video object …

Sam 2: Segment anything in images and videos

N Ravi, V Gabeur, YT Hu, R Hu, C Ryali, T Ma… - arXiv preprint arXiv …, 2024 - arxiv.org
We present Segment Anything Model 2 (SAM 2), a foundation model towards solving
promptable visual segmentation in images and videos. We build a data engine, which …

Segment every reference object in spatial and temporal spaces

J Wu, Y Jiang, B Yan, H Lu… - Proceedings of the …, 2023 - openaccess.thecvf.com
The reference-based object segmentation tasks, namely referring image segmentation
(RIS), referring video object segmentation (RVOS), and video object segmentation (VOS) …

Xmem++: Production-level video segmentation from few annotated frames

M Bekuzarov, A Bermudez… - Proceedings of the …, 2023 - openaccess.thecvf.com
Despite advancements in user-guided video segmentation, extracting complex objects
consistently for highly complex scenes is still a labor-intensive task, especially for …

Onetracker: Unifying visual object tracking with foundation models and efficient tuning

L Hong, S Yan, R Zhang, W Li, X Zhou… - Proceedings of the …, 2024 - openaccess.thecvf.com
Visual object tracking aims to localize the target object of each frame based on its initial
appearance in the first frame. Depending on the input modility tracking tasks can be divided …

Point-VOS: Pointing Up Video Object Segmentation

S Mahadevan, IE Zulfikar… - Proceedings of the …, 2024 - openaccess.thecvf.com
Current state-of-the-art Video Object Segmentation (VOS) methods rely on dense per-object
mask annotations both during training and testing. This requires time-consuming and costly …

Openvis: Open-vocabulary video instance segmentation

P Guo, T Huang, P He, X Liu, T Xiao, Z Chen… - arXiv preprint arXiv …, 2023 - arxiv.org
Open-vocabulary Video Instance Segmentation (OpenVIS) can simultaneously detect,
segment, and track arbitrary object categories in a video, without being constrained to …

Simulflow: Simultaneously extracting feature and identifying target for unsupervised video object segmentation

L Hong, W Zhang, S Gao, H Lu, WQ Zhang - Proceedings of the 31st …, 2023 - dl.acm.org
Unsupervised video object segmentation (UVOS) aims at detecting the primary objects in a
given video sequence without any human interposing. Most existing methods rely on two …

Learning to Segment Referred Objects from Narrated Egocentric Videos

Y Shen, H Wang, X Yang, M Feiszli… - Proceedings of the …, 2024 - openaccess.thecvf.com
Egocentric videos provide a first-person perspective of the wearer's activities involving
simultaneous interactions with multiple objects. In this work we propose the task of weakly …

RMem: Restricted Memory Banks Improve Video Object Segmentation

J Zhou, Z Pang, YX Wang - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
With recent video object segmentation (VOS) benchmarks evolving to challenging scenarios
we revisit a simple but overlooked strategy: restricting the size of memory banks. This …