Joint semantic segmentation and 3d reconstruction from monocular video

J Janai, F Güney, A Behl, A Geiger - Foundations and Trends® …, 2020 - nowpublishers.com

Recent years have witnessed enormous progress in AI-related fields such as computer
vision, machine learning, and autonomous vehicles. As with any rapidly growing field, it …

被引用次数：1131 相关文章所有 9 个版本

[PDF] mdpi.com

A comprehensive review of deep learning-based crack detection approaches

Y Hamishebahar, H Guan, S So, J Jo - Applied Sciences, 2022 - mdpi.com

The application of deep architectures inspired by the fields of artificial intelligence and
computer vision has made a significant impact on the task of crack detection. As the number …

被引用次数：100 相关文章所有 7 个版本

[PDF] thecvf.com

Panoptic neural fields: A semantic object-aware neural scene representation

A Kundu, K Genova, X Yin, A Fathi… - Proceedings of the …, 2022 - openaccess.thecvf.com

We present PanopticNeRF, an object-aware neural scene representation that decomposes
a scene into a set of objects (things) and background (stuff). Each object is represented by a …

被引用次数：280 相关文章所有 8 个版本

[PDF] science.org

Navigating to objects in the real world

T Gervet, S Chintala, D Batra, J Malik, DS Chaplot - Science Robotics, 2023 - science.org

Semantic navigation is necessary to deploy mobile robots in uncontrolled environments
such as homes or hospitals. Many learning-based approaches have been proposed in …

被引用次数：102 相关文章所有 8 个版本

[PDF] thecvf.com

Nerflets: Local radiance fields for efficient structure-aware 3d scene representation from 2d supervision

X Zhang, A Kundu, T Funkhouser… - Proceedings of the …, 2023 - openaccess.thecvf.com

We address efficient and structure-aware 3D scene representation from images. Nerflets are
our key contribution--a set of local neural radiance fields that together represent a scene …

被引用次数：47 相关文章所有 7 个版本

[PDF] thecvf.com

The apolloscape dataset for autonomous driving

X Huang, X Cheng, Q Geng, B Cao… - Proceedings of the …, 2018 - openaccess.thecvf.com

Scene parsing aims to assign a class (semantic) label for each pixel in an image. It is a
comprehensive analysis of an image. Given the rise of autonomous driving, pixel-accurate …

被引用次数：1334 相关文章所有 20 个版本

[PDF] thecvf.com

Espnet: Efficient spatial pyramid of dilated convolutions for semantic segmentation

S Mehta, M Rastegari, A Caspi… - Proceedings of the …, 2018 - openaccess.thecvf.com

We introduce a fast and efficient convolutional neural network, ESPNet, for semantic
segmentation of high resolution images under resource constraints. ESPNet is based on a …

被引用次数：1051 相关文章所有 17 个版本

[PDF] thecvf.com

Tangent convolutions for dense prediction in 3d

M Tatarchenko, J Park, V Koltun… - Proceedings of the …, 2018 - openaccess.thecvf.com

We present an approach to semantic scene analysis using deep convolutional networks.
Our approach is based on tangent convolutions-a new construction for convolutional …

被引用次数：674 相关文章所有 18 个版本

[PDF] arxiv.org

Past, present, and future of simultaneous localization and mapping: Toward the robust-perception age

C Cadena, L Carlone, H Carrillo, Y Latif… - IEEE Transactions …, 2016 - ieeexplore.ieee.org

Simultaneous localization and mapping (SLAM) consists in the concurrent construction of a
model of the environment (the map), and the estimation of the state of the robot moving …

被引用次数：4350 相关文章所有 22 个版本

[PDF] thecvf.com

The cityscapes dataset for semantic urban scene understanding

M Cordts, M Omran, S Ramos… - Proceedings of the …, 2016 - openaccess.thecvf.com

Visual understanding of complex urban street scenes is an enabling factor for a wide range
of applications. Object detection has benefited enormously from large-scale datasets …

被引用次数：14418 相关文章所有 21 个版本