Robust object detection with interleaved categorization and segmentation

J Janai, F Güney, A Behl, A Geiger - Foundations and Trends® …, 2020 - nowpublishers.com

Recent years have witnessed enormous progress in AI-related fields such as computer
vision, machine learning, and autonomous vehicles. As with any rapidly growing field, it …

被引用次数：1131 相关文章所有 9 个版本

[PDF] epfl.ch

Pedestrian detection: An evaluation of the state of the art

P Dollar, C Wojek, B Schiele… - IEEE transactions on …, 2011 - ieeexplore.ieee.org

Pedestrian detection is a key problem in computer vision, with several applications that have
the potential to positively impact quality of life. In recent years, the number of approaches to …

被引用次数：4158 相关文章所有 24 个版本

[PDF] thecvf.com

Deep hough voting for 3d object detection in point clouds

CR Qi, O Litany, K He… - proceedings of the IEEE …, 2019 - openaccess.thecvf.com

Current 3D object detection methods are heavily influenced by 2D detectors. In order to
leverage architectures in 2D detectors, they often convert 3D point clouds to regular grids …

被引用次数：1470 相关文章所有 13 个版本

[PDF] arxiv.org

Mask3d: Mask transformer for 3d semantic instance segmentation

J Schult, F Engelmann, A Hermans… - … on Robotics and …, 2023 - ieeexplore.ieee.org

Modern 3D semantic instance segmentation approaches predominantly rely on specialized
voting mechanisms followed by carefully designed geometric clustering techniques. Building …

被引用次数：149 相关文章所有 5 个版本

[PDF] thecvf.com

Multi-task learning using uncertainty to weigh losses for scene geometry and semantics

A Kendall, Y Gal, R Cipolla - Proceedings of the IEEE …, 2018 - openaccess.thecvf.com

Numerous deep learning applications benefit from multi-task learning with multiple
regression and classification objectives. In this paper we make the observation that the …

被引用次数：3747 相关文章所有 14 个版本

[PDF] thecvf.com

Ovtrack: Open-vocabulary multiple object tracking

S Li, T Fischer, L Ke, H Ding… - Proceedings of the …, 2023 - openaccess.thecvf.com

The ability to recognize, localize and track dynamic objects in a scene is fundamental to
many real-world applications, such as self-driving and robotic systems. Yet, traditional …

被引用次数：53 相关文章所有 7 个版本

[PDF] thecvf.com

The cityscapes dataset for semantic urban scene understanding

M Cordts, M Omran, S Ramos… - Proceedings of the …, 2016 - openaccess.thecvf.com

Visual understanding of complex urban street scenes is an enabling factor for a wide range
of applications. Object detection has benefited enormously from large-scale datasets …

被引用次数：14418 相关文章所有 21 个版本

[PDF] thecvf.com

Occuseg: Occupancy-aware 3d instance segmentation

L Han, T Zheng, L Xu, L Fang - Proceedings of the IEEE …, 2020 - openaccess.thecvf.com

Abstract 3D instance segmentation, with a variety of applications in robotics and augmented
reality, is in large demands these days. Unlike 2D images that are projective observations of …

被引用次数：285 相关文章所有 9 个版本

[PDF] thecvf.com

Convolutional neural network architecture for geometric matching

I Rocco, R Arandjelovic, J Sivic - Proceedings of the IEEE …, 2017 - openaccess.thecvf.com

We address the problem of determining correspondences between two images in
agreement with a geometric model such as an affine or thin-plate spline transformation, and …

被引用次数：666 相关文章所有 17 个版本

[PDF] arxiv.org

Each part matters: Local patterns facilitate cross-view geo-localization

T Wang, Z Zheng, C Yan, J Zhang… - … on Circuits and …, 2021 - ieeexplore.ieee.org

Cross-view geo-localization is to spot images of the same geographic target from different
platforms, eg, drone-view cameras and satellites. It is challenging in the large visual …

被引用次数：180 相关文章所有 6 个版本