Multi-modal data-efficient 3d scene understanding for autonomous driving

L Kong, X Xu, J Ren, W Zhang, L Pan, K Chen… - arXiv preprint arXiv …, 2024 - arxiv.org
Efficient data utilization is crucial for advancing 3D scene understanding in autonomous
driving, where reliance on heavily human-annotated LiDAR point clouds challenges fully …

Multi-Space Alignments Towards Universal LiDAR Segmentation

Y Liu, L Kong, X Wu, R Chen, X Li… - Proceedings of the …, 2024 - openaccess.thecvf.com
A unified and versatile LiDAR segmentation model with strong robustness and
generalizability is desirable for safe autonomous driving perception. This work presents …

Unified 3d and 4d panoptic segmentation via dynamic shifting networks

F Hong, L Kong, H Zhou, X Zhu, H Li… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
With the rapid advances in autonomous driving, it becomes critical to equip its sensing
system with more holistic 3D perception. However, widely explored tasks like 3D detection …

4d contrastive superflows are dense 3d representation learners

X Xu, L Kong, H Shuai, W Zhang, L Pan, K Chen… - … on Computer Vision, 2025 - Springer
In the realm of autonomous driving, accurate 3D perception is the foundation. However,
developing such models relies on extensive human annotations–a process that is both …

Calib3d: Calibrating model preferences for reliable 3d scene understanding

L Kong, X Xu, J Cen, W Zhang, L Pan, K Chen… - arXiv preprint arXiv …, 2024 - arxiv.org
Safety-critical 3D scene understanding tasks necessitate not only accurate but also
confident predictions from 3D perception models. This study introduces Calib3D, a …

Sfpnet: Sparse focal point network for semantic segmentation on general lidar point clouds

Y Wang, W Zhao, C Cao, T Deng, J Wang… - European Conference on …, 2025 - Springer
Although LiDAR semantic segmentation advances rapidly, state-of-the-art methods often
incorporate specifically designed inductive bias derived from benchmarks originating from …

Is Your LiDAR Placement Optimized for 3D Scene Understanding?

Y Li, L Kong, H Hu, X Xu, X Huang - The Thirty-eighth Annual …, 2024 - openreview.net
The reliability of driving perception systems under unprecedented conditions is crucial for
practical usage. Latest advancements have prompted increasing interest in multi-LiDAR …

An empirical study of training state-of-the-art LiDAR segmentation models

J Sun, C Qing, X Xu, L Kong, Y Liu, L Li, C Zhu… - arXiv preprint arXiv …, 2024 - arxiv.org
In the rapidly evolving field of autonomous driving, precise segmentation of LiDAR data is
crucial for understanding complex 3D environments. Traditional approaches often rely on …

Delving into Multi-modal Multi-task Foundation Models for Road Scene Understanding: From Learning Paradigm Perspectives

S Luo, W Chen, W Tian, R Liu, L Hou… - IEEE Transactions …, 2024 - ieeexplore.ieee.org
Foundation models have indeed made a profound impact on various fields, emerging as
pivotal components that significantly shape the capabilities of intelligent systems. In the …

Optimizing LiDAR Placements for Robust Driving Perception in Adverse Conditions

Y Li, L Kong, H Hu, X Xu, X Huang - arXiv preprint arXiv:2403.17009, 2024 - arxiv.org
The robustness of driving perception systems under unprecedented conditions is crucial for
safety-critical usages. Latest advancements have prompted increasing interests towards …