A review of computer vision techniques for the analysis of urban traffic

N Buch, SA Velastin, J Orwell - IEEE Transactions on intelligent …, 2011 - ieeexplore.ieee.org
Automatic video analysis from urban surveillance cameras is a fast-emerging field based on
computer vision techniques. We present here a comprehensive review of the state-of-the-art …

Semantic image segmentation: Two decades of research

G Csurka, R Volpi, B Chidlovskii - Foundations and Trends® …, 2022 - nowpublishers.com
Semantic image segmentation (SiS) plays a fundamental role in a broad variety of computer
vision applications, providing key information for the global understanding of an image. This …

Max-deeplab: End-to-end panoptic segmentation with mask transformers

H Wang, Y Zhu, H Adam, A Yuille… - Proceedings of the …, 2021 - openaccess.thecvf.com
Abstract We present MaX-DeepLab, the first end-to-end model for panoptic segmentation.
Our approach simplifies the current pipeline that depends heavily on surrogate sub-tasks …

Axial-deeplab: Stand-alone axial-attention for panoptic segmentation

H Wang, Y Zhu, B Green, H Adam, A Yuille… - European conference on …, 2020 - Springer
Convolution exploits locality for efficiency at a cost of missing long range context. Self-
attention has been adopted to augment CNNs with non-local interactions. Recent works …

Panoptic-deeplab: A simple, strong, and fast baseline for bottom-up panoptic segmentation

B Cheng, MD Collins, Y Zhu, T Liu… - Proceedings of the …, 2020 - openaccess.thecvf.com
In this work, we introduce Panoptic-DeepLab, a simple, strong, and fast system for panoptic
segmentation, aiming to establish a solid baseline for bottom-up methods that can achieve …

Cmt-deeplab: Clustering mask transformers for panoptic segmentation

Q Yu, H Wang, D Kim, S Qiao… - Proceedings of the …, 2022 - openaccess.thecvf.com
Abstract We propose Clustering Mask Transformer (CMT-DeepLab), a transformer-based
framework for panoptic segmentation designed around clustering. It rethinks the existing …

Deep hough voting for 3d object detection in point clouds

CR Qi, O Litany, K He… - proceedings of the IEEE …, 2019 - openaccess.thecvf.com
Current 3D object detection methods are heavily influenced by 2D detectors. In order to
leverage architectures in 2D detectors, they often convert 3D point clouds to regular grids …

Posecnn: A convolutional neural network for 6d object pose estimation in cluttered scenes

Y Xiang, T Schmidt, V Narayanan, D Fox - arXiv preprint arXiv:1711.00199, 2017 - arxiv.org
Estimating the 6D pose of known objects is important for robots to interact with the real
world. The problem is challenging due to the variety of objects as well as the complexity of a …

Synthetic data for text localisation in natural images

A Gupta, A Vedaldi… - Proceedings of the IEEE …, 2016 - openaccess.thecvf.com
In this paper we introduce a new method for text detection in natural images. The method
comprises two contributions: First, a fast and scalable engine to generate synthetic images …

Vip-deeplab: Learning visual perception with depth-aware video panoptic segmentation

S Qiao, Y Zhu, H Adam, A Yuille… - Proceedings of the …, 2021 - openaccess.thecvf.com
In this paper, we present ViP-DeepLab, a unified model attempting to tackle the long-
standing and challenging inverse projection problem in vision, which we model as restoring …