Cmt-deeplab: Clustering mask transformers for panoptic segmentation

Q Yu, J He, X Deng, X Shen… - Advances in Neural …, 2024 - proceedings.neurips.cc

Open-vocabulary segmentation is a challenging task requiring segmenting and recognizing
objects from an open set of categories in diverse environments. One way to address this …

被引用次数：65 相关文章所有 5 个版本

[PDF] thecvf.com

Oneformer: One transformer to rule universal image segmentation

J Jain, J Li, MT Chiu, A Hassani… - Proceedings of the …, 2023 - openaccess.thecvf.com

Abstract Universal Image Segmentation is not a new concept. Past attempts to unify image
segmentation include scene parsing, panoptic segmentation, and, more recently, new …

被引用次数：209 相关文章所有 8 个版本

[PDF] thecvf.com

Detrs with hybrid matching

D Jia, Y Yuan, H He, X Wu, H Yu… - Proceedings of the …, 2023 - openaccess.thecvf.com

One-to-one set matching is a key design for DETR to establish its end-to-end capability, so
that object detection does not require a hand-crafted NMS (non-maximum suppression) to …

被引用次数：137 相关文章所有 6 个版本

[PDF] thecvf.com

A generalist framework for panoptic segmentation of images and videos

T Chen, L Li, S Saxena, G Hinton… - Proceedings of the …, 2023 - openaccess.thecvf.com

Panoptic segmentation assigns semantic and instance ID labels to every pixel of an image.
As permutations of instance IDs are also valid solutions, the task requires learning of high …

被引用次数：76 相关文章所有 7 个版本

[PDF] ieee.org

Transformer meets remote sensing video detection and tracking: A comprehensive survey

L Jiao, X Zhang, X Liu, F Liu, S Yang… - IEEE Journal of …, 2023 - ieeexplore.ieee.org

Transformer has shown excellent performance in remote sensing field with long-range
modeling capabilities. Remote sensing video (RSV) moving object detection and tracking …

被引用次数：12 相关文章所有 2 个版本

[PDF] neurips.cc

Rank-DETR for high quality object detection

Y Pu, W Liang, Y Hao, Y Yuan… - Advances in …, 2024 - proceedings.neurips.cc

Modern detection transformers (DETRs) use a set of object queries to predict a list of
bounding boxes, sort them by their classification confidence scores, and select the top …

被引用次数：17 相关文章所有 5 个版本

[PDF] arxiv.org

Clustseg: Clustering for universal segmentation

J Liang, T Zhou, D Liu, W Wang - arXiv preprint arXiv:2305.02187, 2023 - arxiv.org

We present CLUSTSEG, a general, transformer-based framework that tackles different
image segmentation tasks (ie, superpixel, semantic, instance, and panoptic) through a …

被引用次数：65 相关文章所有 5 个版本

[PDF] neurips.cc

Clusterfomer: clustering as a universal visual learner

J Liang, Y Cui, Q Wang, T Geng… - Advances in neural …, 2024 - proceedings.neurips.cc

This paper presents ClusterFormer, a universal vision model that is based on the Clustering
paradigm with TransFormer. It comprises two novel designs: 1) recurrent cross-attention …

被引用次数：21 相关文章所有 5 个版本

[PDF] arxiv.org

Transformer-based visual segmentation: A survey

X Li, H Ding, H Yuan, W Zhang, J Pang… - arXiv preprint arXiv …, 2023 - arxiv.org

Visual segmentation seeks to partition images, video frames, or point clouds into multiple
segments or groups. This technique has numerous real-world applications, such as …

被引用次数：57 相关文章所有 3 个版本

[PDF] thecvf.com

Fastinst: A simple query-based model for real-time instance segmentation

J He, P Li, Y Geng, X Xie - … of the IEEE/CVF conference on …, 2023 - openaccess.thecvf.com

Recent attention in instance segmentation has focused on query-based models. Despite
being non-maximum suppression (NMS)-free and end-to-end, the superiority of these …

被引用次数：28 相关文章所有 6 个版本