Computer vision for autonomous vehicles: Problems, datasets and state of the art

J Janai, F Güney, A Behl, A Geiger - Foundations and Trends® …, 2020 - nowpublishers.com
Recent years have witnessed enormous progress in AI-related fields such as computer
vision, machine learning, and autonomous vehicles. As with any rapidly growing field, it …

Mesh r-cnn

G Gkioxari, J Malik, J Johnson - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com
Rapid advances in 2D perception have led to systems that accurately detect objects in real-
world images. However, these systems make predictions in 2D, ignoring the 3D structure of …

Dust3r: Geometric 3d vision made easy

S Wang, V Leroy, Y Cabon… - Proceedings of the …, 2024 - openaccess.thecvf.com
Multi-view stereo reconstruction (MVS) in the wild requires to first estimate the camera
intrinsic and extrinsic parameters. These are usually tedious and cumbersome to obtain yet …

Past, present, and future of simultaneous localization and mapping: Toward the robust-perception age

C Cadena, L Carlone, H Carrillo, Y Latif… - IEEE Transactions …, 2016 - ieeexplore.ieee.org
Simultaneous localization and mapping (SLAM) consists in the concurrent construction of a
model of the environment (the map), and the estimation of the state of the robot moving …

Cubeslam: Monocular 3-d object slam

S Yang, S Scherer - IEEE Transactions on Robotics, 2019 - ieeexplore.ieee.org
In this paper, we present a method for single image three-dimensional (3-D) cuboid object
detection and multiview object simultaneous localization and mapping in both static and …

3d-r2n2: A unified approach for single and multi-view 3d object reconstruction

CB Choy, D Xu, JY Gwak, K Chen… - Computer Vision–ECCV …, 2016 - Springer
Inspired by the recent success of methods that employ shape priors to achieve robust 3D
reconstructions, we propose a novel recurrent neural network architecture that we call the …

Implicit surface representations as layers in neural networks

M Michalkiewicz, JK Pontes, D Jack… - Proceedings of the …, 2019 - openaccess.thecvf.com
Implicit shape representations, such as Level Sets, provide a very elegant formulation for
performing computations involving curves and surfaces. However, including implicit …

Semantics for robotic mapping, perception and interaction: A survey

S Garg, N Sünderhauf, F Dayoub… - … and Trends® in …, 2020 - nowpublishers.com
For robots to navigate and interact more richly with the world around them, they will likely
require a deeper understanding of the world in which they operate. In robotics and related …

Learning a multi-view stereo machine

A Kar, C Häne, J Malik - Advances in neural information …, 2017 - proceedings.neurips.cc
We present a learnt system for multi-view stereopsis. In contrast to recent learning based
methods for 3D reconstruction, we leverage the underlying 3D geometry of the problem …

Hierarchical surface prediction for 3d object reconstruction

C Häne, S Tulsiani, J Malik - 2017 International Conference on …, 2017 - ieeexplore.ieee.org
Recently, Convolutional Neural Networks have shown promising results for 3D geometry
prediction. They can make predictions from very little input data such as a single color …