Computer vision for autonomous vehicles: Problems, datasets and state of the art

J Janai, F Güney, A Behl, A Geiger - Foundations and Trends® …, 2020 - nowpublishers.com
Recent years have witnessed enormous progress in AI-related fields such as computer
vision, machine learning, and autonomous vehicles. As with any rapidly growing field, it …

Pedestrian detection: An evaluation of the state of the art

P Dollar, C Wojek, B Schiele… - IEEE transactions on …, 2011 - ieeexplore.ieee.org
Pedestrian detection is a key problem in computer vision, with several applications that have
the potential to positively impact quality of life. In recent years, the number of approaches to …

Deep hough voting for 3d object detection in point clouds

CR Qi, O Litany, K He… - proceedings of the IEEE …, 2019 - openaccess.thecvf.com
Current 3D object detection methods are heavily influenced by 2D detectors. In order to
leverage architectures in 2D detectors, they often convert 3D point clouds to regular grids …

Mask3d: Mask transformer for 3d semantic instance segmentation

J Schult, F Engelmann, A Hermans… - … on Robotics and …, 2023 - ieeexplore.ieee.org
Modern 3D semantic instance segmentation approaches predominantly rely on specialized
voting mechanisms followed by carefully designed geometric clustering techniques. Building …

Multi-task learning using uncertainty to weigh losses for scene geometry and semantics

A Kendall, Y Gal, R Cipolla - Proceedings of the IEEE …, 2018 - openaccess.thecvf.com
Numerous deep learning applications benefit from multi-task learning with multiple
regression and classification objectives. In this paper we make the observation that the …

Ovtrack: Open-vocabulary multiple object tracking

S Li, T Fischer, L Ke, H Ding… - Proceedings of the …, 2023 - openaccess.thecvf.com
The ability to recognize, localize and track dynamic objects in a scene is fundamental to
many real-world applications, such as self-driving and robotic systems. Yet, traditional …

The cityscapes dataset for semantic urban scene understanding

M Cordts, M Omran, S Ramos… - Proceedings of the …, 2016 - openaccess.thecvf.com
Visual understanding of complex urban street scenes is an enabling factor for a wide range
of applications. Object detection has benefited enormously from large-scale datasets …

Occuseg: Occupancy-aware 3d instance segmentation

L Han, T Zheng, L Xu, L Fang - Proceedings of the IEEE …, 2020 - openaccess.thecvf.com
Abstract 3D instance segmentation, with a variety of applications in robotics and augmented
reality, is in large demands these days. Unlike 2D images that are projective observations of …

Convolutional neural network architecture for geometric matching

I Rocco, R Arandjelovic, J Sivic - Proceedings of the IEEE …, 2017 - openaccess.thecvf.com
We address the problem of determining correspondences between two images in
agreement with a geometric model such as an affine or thin-plate spline transformation, and …

Each part matters: Local patterns facilitate cross-view geo-localization

T Wang, Z Zheng, C Yan, J Zhang… - … on Circuits and …, 2021 - ieeexplore.ieee.org
Cross-view geo-localization is to spot images of the same geographic target from different
platforms, eg, drone-view cameras and satellites. It is challenging in the large visual …