Combined object categorization and segmentation with an implicit shape model

N Buch, SA Velastin, J Orwell - IEEE Transactions on intelligent …, 2011 - ieeexplore.ieee.org

Automatic video analysis from urban surveillance cameras is a fast-emerging field based on
computer vision techniques. We present here a comprehensive review of the state-of-the-art …

被引用次数：837 相关文章所有 9 个版本

[PDF] nowpublishers.com

Semantic image segmentation: Two decades of research

G Csurka, R Volpi, B Chidlovskii - Foundations and Trends® …, 2022 - nowpublishers.com

Semantic image segmentation (SiS) plays a fundamental role in a broad variety of computer
vision applications, providing key information for the global understanding of an image. This …

被引用次数：44 相关文章所有 7 个版本

[PDF] thecvf.com

Max-deeplab: End-to-end panoptic segmentation with mask transformers

H Wang, Y Zhu, H Adam, A Yuille… - Proceedings of the …, 2021 - openaccess.thecvf.com

Abstract We present MaX-DeepLab, the first end-to-end model for panoptic segmentation.
Our approach simplifies the current pipeline that depends heavily on surrogate sub-tasks …

被引用次数：606 相关文章所有 9 个版本

[PDF] arxiv.org

Axial-deeplab: Stand-alone axial-attention for panoptic segmentation

H Wang, Y Zhu, B Green, H Adam, A Yuille… - European conference on …, 2020 - Springer

Convolution exploits locality for efficiency at a cost of missing long range context. Self-
attention has been adopted to augment CNNs with non-local interactions. Recent works …

被引用次数：863 相关文章所有 9 个版本

[PDF] thecvf.com

Panoptic-deeplab: A simple, strong, and fast baseline for bottom-up panoptic segmentation

B Cheng, MD Collins, Y Zhu, T Liu… - Proceedings of the …, 2020 - openaccess.thecvf.com

In this work, we introduce Panoptic-DeepLab, a simple, strong, and fast system for panoptic
segmentation, aiming to establish a solid baseline for bottom-up methods that can achieve …

被引用次数：719 相关文章所有 8 个版本

[PDF] thecvf.com

Cmt-deeplab: Clustering mask transformers for panoptic segmentation

Q Yu, H Wang, D Kim, S Qiao… - Proceedings of the …, 2022 - openaccess.thecvf.com

Abstract We propose Clustering Mask Transformer (CMT-DeepLab), a transformer-based
framework for panoptic segmentation designed around clustering. It rethinks the existing …

被引用次数：100 相关文章所有 6 个版本

[PDF] thecvf.com

Deep hough voting for 3d object detection in point clouds

CR Qi, O Litany, K He… - proceedings of the IEEE …, 2019 - openaccess.thecvf.com

Current 3D object detection methods are heavily influenced by 2D detectors. In order to
leverage architectures in 2D detectors, they often convert 3D point clouds to regular grids …

被引用次数：1470 相关文章所有 13 个版本

[PDF] arxiv.org

Posecnn: A convolutional neural network for 6d object pose estimation in cluttered scenes

Y Xiang, T Schmidt, V Narayanan, D Fox - arXiv preprint arXiv:1711.00199, 2017 - arxiv.org

Estimating the 6D pose of known objects is important for robots to interact with the real
world. The problem is challenging due to the variety of objects as well as the complexity of a …

被引用次数：2256 相关文章所有 12 个版本

[PDF] thecvf.com

Synthetic data for text localisation in natural images

A Gupta, A Vedaldi… - Proceedings of the IEEE …, 2016 - openaccess.thecvf.com

In this paper we introduce a new method for text detection in natural images. The method
comprises two contributions: First, a fast and scalable engine to generate synthetic images …

被引用次数：1842 相关文章所有 14 个版本

[PDF] thecvf.com

Vip-deeplab: Learning visual perception with depth-aware video panoptic segmentation

S Qiao, Y Zhu, H Adam, A Yuille… - Proceedings of the …, 2021 - openaccess.thecvf.com

In this paper, we present ViP-DeepLab, a unified model attempting to tackle the long-
standing and challenging inverse projection problem in vision, which we model as restoring …

被引用次数：164 相关文章所有 6 个版本