Computer vision for autonomous vehicles: Problems, datasets and state of the art

J Janai, F Güney, A Behl, A Geiger - Foundations and Trends® …, 2020 - nowpublishers.com
Recent years have witnessed enormous progress in AI-related fields such as computer
vision, machine learning, and autonomous vehicles. As with any rapidly growing field, it …

A comprehensive review of deep learning-based crack detection approaches

Y Hamishebahar, H Guan, S So, J Jo - Applied Sciences, 2022 - mdpi.com
The application of deep architectures inspired by the fields of artificial intelligence and
computer vision has made a significant impact on the task of crack detection. As the number …

Panoptic neural fields: A semantic object-aware neural scene representation

A Kundu, K Genova, X Yin, A Fathi… - Proceedings of the …, 2022 - openaccess.thecvf.com
We present PanopticNeRF, an object-aware neural scene representation that decomposes
a scene into a set of objects (things) and background (stuff). Each object is represented by a …

Navigating to objects in the real world

T Gervet, S Chintala, D Batra, J Malik, DS Chaplot - Science Robotics, 2023 - science.org
Semantic navigation is necessary to deploy mobile robots in uncontrolled environments
such as homes or hospitals. Many learning-based approaches have been proposed in …

Nerflets: Local radiance fields for efficient structure-aware 3d scene representation from 2d supervision

X Zhang, A Kundu, T Funkhouser… - Proceedings of the …, 2023 - openaccess.thecvf.com
We address efficient and structure-aware 3D scene representation from images. Nerflets are
our key contribution--a set of local neural radiance fields that together represent a scene …

The apolloscape dataset for autonomous driving

X Huang, X Cheng, Q Geng, B Cao… - Proceedings of the …, 2018 - openaccess.thecvf.com
Scene parsing aims to assign a class (semantic) label for each pixel in an image. It is a
comprehensive analysis of an image. Given the rise of autonomous driving, pixel-accurate …

Espnet: Efficient spatial pyramid of dilated convolutions for semantic segmentation

S Mehta, M Rastegari, A Caspi… - Proceedings of the …, 2018 - openaccess.thecvf.com
We introduce a fast and efficient convolutional neural network, ESPNet, for semantic
segmentation of high resolution images under resource constraints. ESPNet is based on a …

Tangent convolutions for dense prediction in 3d

M Tatarchenko, J Park, V Koltun… - Proceedings of the …, 2018 - openaccess.thecvf.com
We present an approach to semantic scene analysis using deep convolutional networks.
Our approach is based on tangent convolutions-a new construction for convolutional …

Past, present, and future of simultaneous localization and mapping: Toward the robust-perception age

C Cadena, L Carlone, H Carrillo, Y Latif… - IEEE Transactions …, 2016 - ieeexplore.ieee.org
Simultaneous localization and mapping (SLAM) consists in the concurrent construction of a
model of the environment (the map), and the estimation of the state of the robot moving …

The cityscapes dataset for semantic urban scene understanding

M Cordts, M Omran, S Ramos… - Proceedings of the …, 2016 - openaccess.thecvf.com
Visual understanding of complex urban street scenes is an enabling factor for a wide range
of applications. Object detection has benefited enormously from large-scale datasets …