Sni-slam: Semantic neural implicit slam

S Zhu, G Wang, H Blum, J Liu, L Song… - Proceedings of the …, 2024 - openaccess.thecvf.com
We propose SNI-SLAM a semantic SLAM system utilizing neural implicit representation that
simultaneously performs accurate semantic mapping high-quality surface reconstruction and …

Deep learning based 3D segmentation: A survey

Y He, H Yu, X Liu, Z Yang, W Sun, S Anwar… - arXiv preprint arXiv …, 2021 - arxiv.org
3D segmentation is a fundamental and challenging problem in computer vision with
applications in autonomous driving and robotics. It has received significant attention from the …

Recent advances in multi-modal 3D scene understanding: A comprehensive survey and evaluation

Y Lei, Z Wang, F Chen, G Wang, P Wang… - arXiv preprint arXiv …, 2023 - arxiv.org
Multi-modal 3D scene understanding has gained considerable attention due to its wide
applications in many areas, such as autonomous driving and human-computer interaction …

Learning modality-agnostic representation for semantic segmentation from any modalities

X Zheng, Y Lyu, L Wang - European Conference on Computer Vision, 2025 - Springer
Image modality is not perfect as it often fails in certain conditions, eg, night and fast motion.
This significantly limits the robustness and versatility of existing multi-modal (ie, Image+ X) …

Centering the value of every modality: Towards efficient and resilient modality-agnostic semantic segmentation

X Zheng, Y Lyu, J Zhou, L Wang - European Conference on Computer …, 2025 - Springer
Fusing an arbitrary number of modalities is vital for achieving robust multi-modal fusion of
semantic segmentation yet remains less explored to date. Recent endeavors regard RGB …

4D-Former: Multimodal 4D panoptic segmentation

A Athar, E Li, S Casas… - Conference on Robot …, 2023 - proceedings.mlr.press
Abstract 4D panoptic segmentation is a challenging but practically useful task that requires
every point in a LiDAR point-cloud sequence to be assigned a semantic class label, and …

TASeg: Temporal Aggregation Network for LiDAR Semantic Segmentation

X Wu, Y Hou, X Huang, B Lin, T He… - Proceedings of the …, 2024 - openaccess.thecvf.com
Training deep models for LiDAR semantic segmentation is challenging due to the inherent
sparsity of point clouds. Utilizing temporal data is a natural remedy against the sparsity …

Uni-to-Multi Modal Knowledge Distillation for Bidirectional LiDAR-Camera Semantic Segmentation

T Sun, Z Zhang, X Tan, Y Peng, Y Qu… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Combining LiDAR points and images for robust semantic segmentation has shown great
potential. However, the heterogeneity between the two modalities (eg the density, the field of …

Multimodal transformer for material segmentation

MK Reza, A Prater-Bennette, MS Asif - arXiv preprint arXiv:2309.04001, 2023 - arxiv.org
Leveraging information across diverse modalities is known to enhance performance on
multimodal segmentation tasks. However, effectively fusing information from different …

Practical self-driving cars: Survey of the state-of-the-art

D Saha, S De - 2022 - preprints.org
Abstract Self-Driving Vehicles or Autonomous Driving (AD) have emerged as the prime field
of research in Artificial Intelligence and Machine Learning of late. The indicated market …