Asymmetric cross-guided attention network for actor and action video segmentation from natural...

H Ding, C Liu, S He, X Jiang… - Proceedings of the …, 2023 - openaccess.thecvf.com

Video object segmentation (VOS) aims at segmenting a particular object throughout the
entire video clip sequence. The state-of-the-art VOS methods have achieved excellent …

被引用次数：114 相关文章所有 7 个版本

[PDF] thecvf.com

MeViS: A large-scale benchmark for video segmentation with motion expressions

H Ding, C Liu, S He, X Jiang… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

This paper strives for motion expressions guided video segmentation, which focuses on
segmenting objects in video content based on a sentence describing the motion of the …

被引用次数：71 相关文章所有 6 个版本

[PDF] thecvf.com

Language as queries for referring video object segmentation

J Wu, Y Jiang, P Sun, Z Yuan… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com

Referring video object segmentation (R-VOS) is an emerging cross-modal task that aims to
segment the target object referred by a language expression in all video frames. In this work …

被引用次数：158 相关文章所有 7 个版本

[PDF] thecvf.com

End-to-end referring video object segmentation with multimodal transformers

A Botach, E Zheltonozhskii… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com

The referring video object segmentation task (RVOS) involves segmentation of a text-
referred object instance in the frames of a given video. Due to the complex nature of this …

被引用次数：152 相关文章所有 5 个版本

[PDF] thecvf.com

Onlinerefer: A simple online baseline for referring video object segmentation

D Wu, T Wang, Y Zhang, X Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com

Referring video object segmentation (RVOS) aims at segmenting an object in a video
following human instruction. Current state-of-the-art methods fall into an offline pattern, in …

被引用次数：41 相关文章所有 5 个版本

[PDF] thecvf.com

Spectrum-guided multi-granularity referring video object segmentation

B Miao, M Bennamoun, Y Gao… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Current referring video object segmentation (R-VOS) techniques extract conditional kernels
from encoded (low-resolution) vision-language features to segment the decoded high …

被引用次数：40 相关文章所有 7 个版本

[PDF] arxiv.org

A survey on deep learning technique for video segmentation

T Zhou, F Porikli, DJ Crandall… - IEEE transactions on …, 2022 - ieeexplore.ieee.org

Video segmentation—partitioning video frames into multiple segments or objects—plays a
critical role in a broad range of practical applications, from enhancing visual effects in movie …

被引用次数：159 相关文章所有 9 个版本

[PDF] thecvf.com

Html: Hybrid temporal-scale multimodal learning framework for referring video object segmentation

M Han, Y Wang, Z Li, L Yao… - Proceedings of the …, 2023 - openaccess.thecvf.com

Abstract Referring Video Object Segmentation (RVOS) is to segment the object instance
from a given video, according to the textual description of this object. However, in the open …

被引用次数：23 相关文章所有 6 个版本

[PDF] thecvf.com

Temporal collection and distribution for referring video object segmentation

J Tang, G Zheng, S Yang - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com

Referring video object segmentation aims to segment a referent throughout a video
sequence according to a natural language expression. It requires aligning the natural …

被引用次数：19 相关文章所有 5 个版本

[PDF] github.io

Urvos: Unified referring video object segmentation network with a large-scale benchmark

S Seo, JY Lee, B Han - Computer Vision–ECCV 2020: 16th European …, 2020 - Springer

We propose a unified referring video object segmentation network (URVOS). URVOS takes
a video and a referring expression as inputs, and estimates the object masks referred by the …

被引用次数：184 相关文章所有 5 个版本