Towards open vocabulary learning: A survey

J Wu, X Li, S Xu, H Yuan, H Ding… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org
In the field of visual scene understanding, deep neural networks have made impressive
advancements in various core tasks like segmentation, tracking, and detection. However …

A survey on open-vocabulary detection and segmentation: Past, present, and future

C Zhu, L Chen - IEEE Transactions on Pattern Analysis and …, 2024 - ieeexplore.ieee.org
As the most fundamental scene understanding tasks, object detection and segmentation
have made tremendous progress in deep learning era. Due to the expensive manual …

Visa: Reasoning video object segmentation via large language models

C Yan, H Wang, S Yan, X Jiang, Y Hu, G Kang… - arXiv preprint arXiv …, 2024 - arxiv.org
Existing Video Object Segmentation (VOS) relies on explicit user instructions, such as
categories, masks, or short phrases, restricting their ability to perform complex video …

3rd place solution for pvuw challenge 2023: Video panoptic segmentation

J Su, W Yang, J Luo, X Wei - arXiv preprint arXiv:2306.06753, 2023 - arxiv.org
In order to deal with the task of video panoptic segmentation in the wild, we propose a robust
integrated video panoptic segmentation solution. In our solution, we regard the video …

Context-Aware Video Instance Segmentation

S Lee, J Seo, K Han, M Choi, S Im - arXiv preprint arXiv:2407.03010, 2024 - arxiv.org
In this paper, we introduce the Context-Aware Video Instance Segmentation (CAVIS), a
novel framework designed to enhance instance association by integrating contextual …

DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries

Y Zhou, T Zhang, S Ji, S Yan, X Li - arXiv preprint arXiv:2404.00086, 2024 - arxiv.org
Modern video segmentation methods adopt object queries to perform inter-frame
association and demonstrate satisfactory performance in tracking continuously appearing …

Rethinking Video Segmentation with Masked Video Consistency: Did the Model Learn as Intended?

C Liang, Q Guo, X Qu, L Liu, T Liu - arXiv preprint arXiv:2408.10627, 2024 - arxiv.org
Video segmentation aims at partitioning video sequences into meaningful segments based
on objects or regions of interest within frames. Current video segmentation models are often …

1st Place Winner of the 2024 Pixel-level Video Understanding in the Wild (CVPR'24 PVUW) Challenge in Video Panoptic Segmentation and Best Long Video …

Q Liu, M El-Khamy, KB Song - arXiv preprint arXiv:2406.05352, 2024 - arxiv.org
The third Pixel-level Video Understanding in the Wild (PVUW CVPR 2024) challenge aims
to advance the state of art in video understanding through benchmarking Video Panoptic …