Dvis++: Improved decoupled framework for universal video segmentation

T Zhang, X Tian, Y Zhou, S Ji, X Wang, X Tao… - arXiv preprint arXiv …, 2023 - arxiv.org
We present the\textbf {D} ecoupled\textbf {VI} deo\textbf {S} egmentation (DVIS) framework, a
novel approach for the challenging task of universal video segmentation, including video …

Visa: Reasoning video object segmentation via large language models

C Yan, H Wang, S Yan, X Jiang, Y Hu, G Kang… - arXiv preprint arXiv …, 2024 - arxiv.org
Existing Video Object Segmentation (VOS) relies on explicit user instructions, such as
categories, masks, or short phrases, restricting their ability to perform complex video …