Delving into the devils of bird's-eye-view perception: A review, evaluation and recipe
Learning powerful representations in bird's-eye-view (BEV) for perception tasks is trending
and drawing extensive attention both from industry and academia. Conventional …
and drawing extensive attention both from industry and academia. Conventional …
Robustness-aware 3d object detection in autonomous driving: A review and outlook
In the realm of modern autonomous driving, the perception system is indispensable for
accurately assessing the state of the surrounding environment, thereby enabling informed …
accurately assessing the state of the surrounding environment, thereby enabling informed …
Dsvt: Dynamic sparse voxel transformer with rotated sets
Designing an efficient yet deployment-friendly 3D backbone to handle sparse point clouds is
a fundamental problem in 3D perception. Compared with the customized sparse …
a fundamental problem in 3D perception. Compared with the customized sparse …
PV-RCNN++: Point-voxel feature set abstraction with local vector representation for 3D object detection
Abstract 3D object detection is receiving increasing attention from both industry and
academia thanks to its wide applications in various fields. In this paper, we propose Point …
academia thanks to its wide applications in various fields. In this paper, we propose Point …
Unitr: A unified and efficient multi-modal transformer for bird's-eye-view representation
Jointly processing information from multiple sensors is crucial to achieving accurate and
robust perception for reliable autonomous driving systems. However, current 3D perception …
robust perception for reliable autonomous driving systems. However, current 3D perception …
Cagroup3d: Class-aware grouping for 3d object detection on point clouds
We present a novel two-stage fully sparse convolutional 3D object detection framework,
named CAGroup3D. Our proposed method first generates some high-quality 3D proposals …
named CAGroup3D. Our proposed method first generates some high-quality 3D proposals …
GD-MAE: generative decoder for MAE pre-training on lidar point clouds
Despite the tremendous progress of Masked Autoencoders (MAE) in developing vision tasks
such as image and video, exploring MAE in large-scale 3D point clouds remains …
such as image and video, exploring MAE in large-scale 3D point clouds remains …
Nerf-rpn: A general framework for object detection in nerfs
This paper presents the first significant object detection framework, NeRF-RPN, which
directly operates on NeRF. Given a pre-trained NeRF model, NeRF-RPN aims to detect all …
directly operates on NeRF. Given a pre-trained NeRF model, NeRF-RPN aims to detect all …
Mask-attention-free transformer for 3d instance segmentation
Recently, transformer-based methods have dominated 3D instance segmentation, where
mask attention is commonly involved. Specifically, object queries are guided by the initial …
mask attention is commonly involved. Specifically, object queries are guided by the initial …
Uni3detr: Unified 3d detection transformer
Existing point cloud based 3D detectors are designed for the particular scene, either indoor
or outdoor ones. Because of the substantial differences in object distribution and point …
or outdoor ones. Because of the substantial differences in object distribution and point …