Lvos: A benchmark for long-term video object segmentation

HK Cheng, SW Oh, B Price, JY Lee… - Proceedings of the …, 2024 - openaccess.thecvf.com

We present Cutie a video object segmentation (VOS) network with object-level memory
reading which puts the object representation from memory back into the video object …

被引用次数：32 相关文章所有 4 个版本

[PDF] arxiv.org

Sam 2: Segment anything in images and videos

N Ravi, V Gabeur, YT Hu, R Hu, C Ryali, T Ma… - arXiv preprint arXiv …, 2024 - arxiv.org

We present Segment Anything Model 2 (SAM 2), a foundation model towards solving
promptable visual segmentation in images and videos. We build a data engine, which …

被引用次数：24 相关文章所有 2 个版本

[PDF] thecvf.com

Segment every reference object in spatial and temporal spaces

J Wu, Y Jiang, B Yan, H Lu… - Proceedings of the …, 2023 - openaccess.thecvf.com

The reference-based object segmentation tasks, namely referring image segmentation
(RIS), referring video object segmentation (RVOS), and video object segmentation (VOS) …

被引用次数：9 相关文章所有 3 个版本

[PDF] thecvf.com

Xmem++: Production-level video segmentation from few annotated frames

M Bekuzarov, A Bermudez… - Proceedings of the …, 2023 - openaccess.thecvf.com

Despite advancements in user-guided video segmentation, extracting complex objects
consistently for highly complex scenes is still a labor-intensive task, especially for …

被引用次数：17 相关文章所有 6 个版本

[PDF] thecvf.com

Onetracker: Unifying visual object tracking with foundation models and efficient tuning

L Hong, S Yan, R Zhang, W Li, X Zhou… - Proceedings of the …, 2024 - openaccess.thecvf.com

Visual object tracking aims to localize the target object of each frame based on its initial
appearance in the first frame. Depending on the input modility tracking tasks can be divided …

被引用次数：11 相关文章所有 3 个版本

[PDF] thecvf.com

Point-VOS: Pointing Up Video Object Segmentation

S Mahadevan, IE Zulfikar… - Proceedings of the …, 2024 - openaccess.thecvf.com

Current state-of-the-art Video Object Segmentation (VOS) methods rely on dense per-object
mask annotations both during training and testing. This requires time-consuming and costly …

[PDF] arxiv.org

Openvis: Open-vocabulary video instance segmentation

P Guo, T Huang, P He, X Liu, T Xiao, Z Chen… - arXiv preprint arXiv …, 2023 - arxiv.org

Open-vocabulary Video Instance Segmentation (OpenVIS) can simultaneously detect,
segment, and track arbitrary object categories in a video, without being constrained to …

被引用次数：14 相关文章所有 2 个版本

Simulflow: Simultaneously extracting feature and identifying target for unsupervised video object segmentation

L Hong, W Zhang, S Gao, H Lu, WQ Zhang - Proceedings of the 31st …, 2023 - dl.acm.org

Unsupervised video object segmentation (UVOS) aims at detecting the primary objects in a
given video sequence without any human interposing. Most existing methods rely on two …

被引用次数：5 相关文章所有 3 个版本

[PDF] thecvf.com

Learning to Segment Referred Objects from Narrated Egocentric Videos

Y Shen, H Wang, X Yang, M Feiszli… - Proceedings of the …, 2024 - openaccess.thecvf.com

Egocentric videos provide a first-person perspective of the wearer's activities involving
simultaneous interactions with multiple objects. In this work we propose the task of weakly …

[PDF] thecvf.com

RMem: Restricted Memory Banks Improve Video Object Segmentation

J Zhou, Z Pang, YX Wang - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com

With recent video object segmentation (VOS) benchmarks evolving to challenging scenarios
we revisit a simple but overlooked strategy: restricting the size of memory banks. This …