Delving into the devils of bird's-eye-view perception: A review, evaluation and recipe

H Li, C Sima, J Dai, W Wang, L Lu… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
Learning powerful representations in bird's-eye-view (BEV) for perception tasks is trending
and drawing extensive attention both from industry and academia. Conventional …

Robustness-aware 3d object detection in autonomous driving: A review and outlook

Z Song, L Liu, F Jia, Y Luo, C Jia… - IEEE Transactions …, 2024 - ieeexplore.ieee.org
In the realm of modern autonomous driving, the perception system is indispensable for
accurately assessing the state of the surrounding environment, thereby enabling informed …

Dsvt: Dynamic sparse voxel transformer with rotated sets

H Wang, C Shi, S Shi, M Lei, S Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Designing an efficient yet deployment-friendly 3D backbone to handle sparse point clouds is
a fundamental problem in 3D perception. Compared with the customized sparse …

PV-RCNN++: Point-voxel feature set abstraction with local vector representation for 3D object detection

S Shi, L Jiang, J Deng, Z Wang, C Guo, J Shi… - International Journal of …, 2023 - Springer
Abstract 3D object detection is receiving increasing attention from both industry and
academia thanks to its wide applications in various fields. In this paper, we propose Point …

Unitr: A unified and efficient multi-modal transformer for bird's-eye-view representation

H Wang, H Tang, S Shi, A Li, Z Li… - Proceedings of the …, 2023 - openaccess.thecvf.com
Jointly processing information from multiple sensors is crucial to achieving accurate and
robust perception for reliable autonomous driving systems. However, current 3D perception …

Cagroup3d: Class-aware grouping for 3d object detection on point clouds

H Wang, L Ding, S Dong, S Shi, A Li… - Advances in Neural …, 2022 - proceedings.neurips.cc
We present a novel two-stage fully sparse convolutional 3D object detection framework,
named CAGroup3D. Our proposed method first generates some high-quality 3D proposals …

GD-MAE: generative decoder for MAE pre-training on lidar point clouds

H Yang, T He, J Liu, H Chen, B Wu… - Proceedings of the …, 2023 - openaccess.thecvf.com
Despite the tremendous progress of Masked Autoencoders (MAE) in developing vision tasks
such as image and video, exploring MAE in large-scale 3D point clouds remains …

Nerf-rpn: A general framework for object detection in nerfs

B Hu, J Huang, Y Liu, YW Tai… - Proceedings of the …, 2023 - openaccess.thecvf.com
This paper presents the first significant object detection framework, NeRF-RPN, which
directly operates on NeRF. Given a pre-trained NeRF model, NeRF-RPN aims to detect all …

Mask-attention-free transformer for 3d instance segmentation

X Lai, Y Yuan, R Chu, Y Chen… - Proceedings of the …, 2023 - openaccess.thecvf.com
Recently, transformer-based methods have dominated 3D instance segmentation, where
mask attention is commonly involved. Specifically, object queries are guided by the initial …

Uni3detr: Unified 3d detection transformer

Z Wang, YL Li, X Chen, H Zhao… - Advances in Neural …, 2024 - proceedings.neurips.cc
Existing point cloud based 3D detectors are designed for the particular scene, either indoor
or outdoor ones. Because of the substantial differences in object distribution and point …