Computer vision for autonomous vehicles: Problems, datasets and state of the art

J Janai, F Güney, A Behl, A Geiger - Foundations and Trends® …, 2020 - nowpublishers.com
Recent years have witnessed enormous progress in AI-related fields such as computer
vision, machine learning, and autonomous vehicles. As with any rapidly growing field, it …

Categorical depth distribution network for monocular 3d object detection

C Reading, A Harakeh, J Chae… - Proceedings of the …, 2021 - openaccess.thecvf.com
Monocular 3D object detection is a key problem for autonomous vehicles, as it provides a
solution with simple configuration compared to typical multi-sensor systems. The main …

Translating images into maps

A Saha, O Mendez, C Russell… - … conference on robotics …, 2022 - ieeexplore.ieee.org
We approach instantaneous mapping, converting images to a top-down view of the world, as
a translation problem. We show how a novel form of transformer network can be used to …

Cross-view semantic segmentation for sensing surroundings

B Pan, J Sun, HYT Leung, A Andonian… - IEEE Robotics and …, 2020 - ieeexplore.ieee.org
Sensing surroundings plays a crucial role in human spatial perception, as it extracts the
spatial configuration of objects as well as the free space from the observations. To facilitate …

Projecting your view attentively: Monocular road scene layout estimation via cross-view transformation

W Yang, Q Li, W Liu, Y Yu, Y Ma… - Proceedings of the …, 2021 - openaccess.thecvf.com
HD map reconstruction is crucial for autonomous driving. LiDAR-based methods are limited
due to the deployed expensive sensors and time-consuming computation. Camera-based …

Enabling spatio-temporal aggregation in birds-eye-view vehicle estimation

A Saha, O Mendez, C Russell… - 2021 ieee international …, 2021 - ieeexplore.ieee.org
Constructing Birds-Eye-View (BEV) maps from monocular images is typically a complex
multi-stage process involving the separate vision tasks of ground plane estimation, road …

Sidewalk extraction using aerial and street view images

H Ning, X Ye, Z Chen, T Liu… - Environment and Planning …, 2022 - journals.sagepub.com
A reliable, punctual, and spatially accurate dataset of sidewalks is vital for identifying where
improvements can be made upon urban environment to enhance multi-modal accessibility …

Monolayout: Amodal scene layout from a single image

K Mani, S Daga, S Garg… - Proceedings of the …, 2020 - openaccess.thecvf.com
In this paper, we address the novel, highly challenging problem of estimating the layout of a
complex urban driving scenario. Given a single color image captured from a driving platform …

Ordered atomic activity for fine-grained interactive traffic scenario understanding

N Agarwal, YT Chen - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
We introduce a novel representation called Ordered Atomic Activity for interactive scenario
understanding. The representation decomposes each scenario into a set of ordered atomic …

Vision-based uneven bev representation learning with polar rasterization and surface estimation

Z Liu, S Chen, X Guo, X Wang… - … on Robot Learning, 2023 - proceedings.mlr.press
In this work, we propose PolarBEV for vision-based uneven BEV representation learning. To
adapt to the foreshortening effect of camera imaging, we rasterize the BEV space both …