Voxnet: A 3d convolutional neural network for real-time object recognition

Z Li, F Liu, W Yang, S Peng… - IEEE transactions on …, 2021 - ieeexplore.ieee.org

A convolutional neural network (CNN) is one of the most significant networks in the deep
learning field. Since CNN made impressive achievements in many areas, including but not …

被引用次数：2890 相关文章所有 7 个版本

Real-time monitoring of construction sites: Sensors, methods, and applications

AS Rao, M Radanovic, Y Liu, S Hu, Y Fang… - Automation in …, 2022 - Elsevier

The construction industry is one of the world's largest industries, with an annual budget of
$10 trillion globally. Despite its size, the efficiency and growth in labour productivity in the …

被引用次数：159 相关文章所有 3 个版本

[PDF] neurips.cc

Point transformer v2: Grouped vector attention and partition-based pooling

X Wu, Y Lao, L Jiang, X Liu… - Advances in Neural …, 2022 - proceedings.neurips.cc

As a pioneering work exploring transformer architecture for 3D point cloud understanding,
Point Transformer achieves impressive results on multiple highly competitive benchmarks. In …

被引用次数：238 相关文章所有 8 个版本

[PDF] thecvf.com

Ulip: Learning a unified representation of language, images, and point clouds for 3d understanding

L Xue, M Gao, C Xing, R Martín-Martín… - Proceedings of the …, 2023 - openaccess.thecvf.com

The recognition capabilities of current state-of-the-art 3D models are limited by datasets with
a small number of annotated data and a pre-defined set of categories. In its 2D counterpart …

被引用次数：168 相关文章所有 6 个版本

[PDF] thecvf.com

Pointclip: Point cloud understanding by clip

R Zhang, Z Guo, W Zhang, K Li… - Proceedings of the …, 2022 - openaccess.thecvf.com

Recently, zero-shot and few-shot learning via Contrastive Vision-Language Pre-training
(CLIP) have shown inspirational performance on 2D visual recognition, which learns to …

被引用次数：351 相关文章所有 5 个版本

[PDF] arxiv.org

Rethinking network design and local geometry in point cloud: A simple residual MLP framework

X Ma, C Qin, H You, H Ran, Y Fu - arXiv preprint arXiv:2202.07123, 2022 - arxiv.org

Point cloud analysis is challenging due to irregularity and unordered data structure. To
capture the 3D geometries, prior works mainly rely on exploring sophisticated local …

被引用次数：550 相关文章所有 3 个版本

[PDF] thecvf.com

Softgroup for 3d instance segmentation on point clouds

T Vu, K Kim, TM Luu, T Nguyen… - Proceedings of the …, 2022 - openaccess.thecvf.com

Existing state-of-the-art 3D instance segmentation methods perform semantic segmentation
followed by grouping. The hard predictions are made when performing semantic …

被引用次数：228 相关文章所有 7 个版本

[PDF] neurips.cc

Deep marching tetrahedra: a hybrid representation for high-resolution 3d shape synthesis

T Shen, J Gao, K Yin, MY Liu… - Advances in Neural …, 2021 - proceedings.neurips.cc

We introduce DMTet, a deep 3D conditional generative model that can synthesize high-
resolution 3D shapes using simple user guides such as coarse voxels. It marries the merits …

被引用次数：345 相关文章所有 9 个版本

[PDF] thecvf.com

An end-to-end transformer model for 3d object detection

I Misra, R Girdhar, A Joulin - Proceedings of the IEEE/CVF …, 2021 - openaccess.thecvf.com

We propose 3DETR, an end-to-end Transformer based object detection model for 3D point
clouds. Compared to existing detection methods that employ a number of 3D-specific …

被引用次数：465 相关文章所有 7 个版本

[PDF] thecvf.com

Point Transformer V3: Simpler Faster Stronger

X Wu, L Jiang, PS Wang, Z Liu, X Liu… - Proceedings of the …, 2024 - openaccess.thecvf.com

This paper is not motivated to seek innovation within the attention mechanism. Instead it
focuses on overcoming the existing trade-offs between accuracy and efficiency within the …

被引用次数：53 相关文章所有 4 个版本