Rtmdet: An empirical study of designing real-time object detectors

C Lyu, W Zhang, H Huang, Y Zhou, Y Wang… - arXiv preprint arXiv …, 2022 - arxiv.org
In this paper, we aim to design an efficient real-time object detector that exceeds the YOLO
series and is easily extensible for many object recognition tasks such as instance …

Transformer-based visual segmentation: A survey

X Li, H Ding, H Yuan, W Zhang, J Pang… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org
Visual segmentation seeks to partition images, video frames, or point clouds into multiple
segments or groups. This technique has numerous real-world applications, such as …

Learning equivariant segmentation with instance-unique querying

W Wang, J Liang, D Liu - Advances in Neural Information …, 2022 - proceedings.neurips.cc
Prevalent state-of-the-art instance segmentation methods fall into a query-based scheme, in
which instance masks are derived by querying the image feature using a set of instance …

A comprehensive review of modern object segmentation approaches

Y Wang, U Ahsan, H Li, M Hagen - Foundations and Trends® …, 2022 - nowpublishers.com
Image segmentation is the task of associating pixels in an image with their respective object
class labels. It has a wide range of applications in many industries including healthcare …

Superpoint transformer for 3d scene instance segmentation

J Sun, C Qing, J Tan, X Xu - Proceedings of the AAAI Conference on …, 2023 - ojs.aaai.org
Most existing methods realize 3D instance segmentation by extending those models used
for 3D object detection or 3D semantic segmentation. However, these non-straightforward …

Clustseg: Clustering for universal segmentation

J Liang, T Zhou, D Liu, W Wang - arXiv preprint arXiv:2305.02187, 2023 - arxiv.org
We present CLUSTSEG, a general, transformer-based framework that tackles different
image segmentation tasks (ie, superpixel, semantic, instance, and panoptic) through a …

You only segment once: Towards real-time panoptic segmentation

J Hu, L Huang, T Ren, S Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com
In this paper, we propose YOSO, a real-time panoptic segmentation framework. YOSO
predicts masks via dynamic convolutions between panoptic kernels and image feature …

Clusterfomer: clustering as a universal visual learner

J Liang, Y Cui, Q Wang, T Geng… - Advances in neural …, 2024 - proceedings.neurips.cc
This paper presents ClusterFormer, a universal vision model that is based on the Clustering
paradigm with TransFormer. It comprises two novel designs: 1) recurrent cross-attention …

Fastinst: A simple query-based model for real-time instance segmentation

J He, P Li, Y Geng, X Xie - … of the IEEE/CVF conference on …, 2023 - openaccess.thecvf.com
Recent attention in instance segmentation has focused on query-based models. Despite
being non-maximum suppression (NMS)-free and end-to-end, the superiority of these …

Latr: 3d lane detection from monocular images with transformer

Y Luo, C Zheng, X Yan, T Kun… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract 3D lane detection from monocular images is a fundamental yet challenging task in
autonomous driving. Recent advances primarily rely on structural 3D surrogates (eg, bird's …