Self-supervised pretraining of 3d features on any point-cloud

J Mao, S Shi, X Wang, H Li - International Journal of Computer Vision, 2023 - Springer

Autonomous driving, in recent years, has been receiving increasing attention for its potential
to relieve drivers' burdens and improve the safety of driving. In modern autonomous driving …

被引用次数：194 相关文章所有 8 个版本

[PDF] arxiv.org

Unsupervised point cloud representation learning with deep neural networks: A survey

A Xiao, J Huang, D Guan, X Zhang… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

Point cloud data have been widely explored due to its superior accuracy and robustness
under various adverse situations. Meanwhile, deep neural networks (DNNs) have achieved …

被引用次数：118 相关文章所有 10 个版本

[PDF] arxiv.org

Masked autoencoders for point cloud self-supervised learning

Y Pang, W Wang, FEH Tay, W Liu, Y Tian… - European conference on …, 2022 - Springer

As a promising scheme of self-supervised learning, masked autoencoding has significantly
advanced natural language processing and computer vision. Inspired by this, we propose a …

被引用次数：502 相关文章所有 6 个版本

[PDF] thecvf.com

Clip2scene: Towards label-efficient 3d scene understanding by clip

R Chen, Y Liu, L Kong, X Zhu, Y Ma… - Proceedings of the …, 2023 - openaccess.thecvf.com

Abstract Contrastive Language-Image Pre-training (CLIP) achieves promising results in 2D
zero-shot and few-shot learning. Despite the impressive performance in 2D, applying CLIP …

被引用次数：141 相关文章所有 6 个版本

[PDF] thecvf.com

Point-bert: Pre-training 3d point cloud transformers with masked point modeling

X Yu, L Tang, Y Rao, T Huang… - Proceedings of the …, 2022 - openaccess.thecvf.com

We present Point-BERT, a novel paradigm for learning Transformers to generalize the
concept of BERT onto 3D point cloud. Following BERT, we devise a Masked Point Modeling …

被引用次数：695 相关文章所有 6 个版本

[PDF] neurips.cc

Point-m2ae: multi-scale masked autoencoders for hierarchical point cloud pre-training

R Zhang, Z Guo, P Gao, R Fang… - Advances in neural …, 2022 - proceedings.neurips.cc

Masked Autoencoders (MAE) have shown great potentials in self-supervised pre-training for
language and 2D image transformers. However, it still remains an open question on how to …

被引用次数：253 相关文章所有 6 个版本

[PDF] thecvf.com

Learning 3d representations from 2d pre-trained models via image-to-point masked autoencoders

R Zhang, L Wang, Y Qiao, P Gao… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Pre-training by numerous image data has become de-facto for robust 2D representations. In
contrast, due to the expensive data processing, a paucity of 3D datasets severely hinders …

被引用次数：137 相关文章所有 5 个版本

[PDF] thecvf.com

Crosspoint: Self-supervised cross-modal contrastive learning for 3d point cloud understanding

M Afham, I Dissanayake… - Proceedings of the …, 2022 - openaccess.thecvf.com

Manual annotation of large-scale point cloud dataset for varying tasks such as 3D object
classification, segmentation and detection is often laborious owing to the irregular structure …

被引用次数：300 相关文章所有 7 个版本

[PDF] arxiv.org

Rethinking network design and local geometry in point cloud: A simple residual MLP framework

X Ma, C Qin, H You, H Ran, Y Fu - arXiv preprint arXiv:2202.07123, 2022 - arxiv.org

Point cloud analysis is challenging due to irregularity and unordered data structure. To
capture the 3D geometries, prior works mainly rely on exploring sophisticated local …

被引用次数：671 相关文章所有 3 个版本

[PDF] thecvf.com

An end-to-end transformer model for 3d object detection

I Misra, R Girdhar, A Joulin - Proceedings of the IEEE/CVF …, 2021 - openaccess.thecvf.com

We propose 3DETR, an end-to-end Transformer based object detection model for 3D point
clouds. Compared to existing detection methods that employ a number of 3D-specific …

被引用次数：528 相关文章所有 7 个版本