MOSE: A new dataset for video object segmentation in complex scenes
Video object segmentation (VOS) aims at segmenting a particular object throughout the
entire video clip sequence. The state-of-the-art VOS methods have achieved excellent …
entire video clip sequence. The state-of-the-art VOS methods have achieved excellent …
MeViS: A large-scale benchmark for video segmentation with motion expressions
This paper strives for motion expressions guided video segmentation, which focuses on
segmenting objects in video content based on a sentence describing the motion of the …
segmenting objects in video content based on a sentence describing the motion of the …
Language as queries for referring video object segmentation
Referring video object segmentation (R-VOS) is an emerging cross-modal task that aims to
segment the target object referred by a language expression in all video frames. In this work …
segment the target object referred by a language expression in all video frames. In this work …
End-to-end referring video object segmentation with multimodal transformers
A Botach, E Zheltonozhskii… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
The referring video object segmentation task (RVOS) involves segmentation of a text-
referred object instance in the frames of a given video. Due to the complex nature of this …
referred object instance in the frames of a given video. Due to the complex nature of this …
Onlinerefer: A simple online baseline for referring video object segmentation
Referring video object segmentation (RVOS) aims at segmenting an object in a video
following human instruction. Current state-of-the-art methods fall into an offline pattern, in …
following human instruction. Current state-of-the-art methods fall into an offline pattern, in …
Spectrum-guided multi-granularity referring video object segmentation
Current referring video object segmentation (R-VOS) techniques extract conditional kernels
from encoded (low-resolution) vision-language features to segment the decoded high …
from encoded (low-resolution) vision-language features to segment the decoded high …
A survey on deep learning technique for video segmentation
Video segmentation—partitioning video frames into multiple segments or objects—plays a
critical role in a broad range of practical applications, from enhancing visual effects in movie …
critical role in a broad range of practical applications, from enhancing visual effects in movie …
Html: Hybrid temporal-scale multimodal learning framework for referring video object segmentation
Abstract Referring Video Object Segmentation (RVOS) is to segment the object instance
from a given video, according to the textual description of this object. However, in the open …
from a given video, according to the textual description of this object. However, in the open …
Temporal collection and distribution for referring video object segmentation
Referring video object segmentation aims to segment a referent throughout a video
sequence according to a natural language expression. It requires aligning the natural …
sequence according to a natural language expression. It requires aligning the natural …
Urvos: Unified referring video object segmentation network with a large-scale benchmark
We propose a unified referring video object segmentation network (URVOS). URVOS takes
a video and a referring expression as inputs, and estimates the object masks referred by the …
a video and a referring expression as inputs, and estimates the object masks referred by the …