A survey of convolutional neural networks: analysis, applications, and prospects

Z Li, F Liu, W Yang, S Peng… - IEEE transactions on …, 2021 - ieeexplore.ieee.org
A convolutional neural network (CNN) is one of the most significant networks in the deep
learning field. Since CNN made impressive achievements in many areas, including but not …

Real-time monitoring of construction sites: Sensors, methods, and applications

AS Rao, M Radanovic, Y Liu, S Hu, Y Fang… - Automation in …, 2022 - Elsevier
The construction industry is one of the world's largest industries, with an annual budget of
$10 trillion globally. Despite its size, the efficiency and growth in labour productivity in the …

Point transformer v2: Grouped vector attention and partition-based pooling

X Wu, Y Lao, L Jiang, X Liu… - Advances in Neural …, 2022 - proceedings.neurips.cc
As a pioneering work exploring transformer architecture for 3D point cloud understanding,
Point Transformer achieves impressive results on multiple highly competitive benchmarks. In …

Ulip: Learning a unified representation of language, images, and point clouds for 3d understanding

L Xue, M Gao, C Xing, R Martín-Martín… - Proceedings of the …, 2023 - openaccess.thecvf.com
The recognition capabilities of current state-of-the-art 3D models are limited by datasets with
a small number of annotated data and a pre-defined set of categories. In its 2D counterpart …

Pointclip: Point cloud understanding by clip

R Zhang, Z Guo, W Zhang, K Li… - Proceedings of the …, 2022 - openaccess.thecvf.com
Recently, zero-shot and few-shot learning via Contrastive Vision-Language Pre-training
(CLIP) have shown inspirational performance on 2D visual recognition, which learns to …

Rethinking network design and local geometry in point cloud: A simple residual MLP framework

X Ma, C Qin, H You, H Ran, Y Fu - arXiv preprint arXiv:2202.07123, 2022 - arxiv.org
Point cloud analysis is challenging due to irregularity and unordered data structure. To
capture the 3D geometries, prior works mainly rely on exploring sophisticated local …

Softgroup for 3d instance segmentation on point clouds

T Vu, K Kim, TM Luu, T Nguyen… - Proceedings of the …, 2022 - openaccess.thecvf.com
Existing state-of-the-art 3D instance segmentation methods perform semantic segmentation
followed by grouping. The hard predictions are made when performing semantic …

Deep marching tetrahedra: a hybrid representation for high-resolution 3d shape synthesis

T Shen, J Gao, K Yin, MY Liu… - Advances in Neural …, 2021 - proceedings.neurips.cc
We introduce DMTet, a deep 3D conditional generative model that can synthesize high-
resolution 3D shapes using simple user guides such as coarse voxels. It marries the merits …

An end-to-end transformer model for 3d object detection

I Misra, R Girdhar, A Joulin - Proceedings of the IEEE/CVF …, 2021 - openaccess.thecvf.com
We propose 3DETR, an end-to-end Transformer based object detection model for 3D point
clouds. Compared to existing detection methods that employ a number of 3D-specific …

Point Transformer V3: Simpler Faster Stronger

X Wu, L Jiang, PS Wang, Z Liu, X Liu… - Proceedings of the …, 2024 - openaccess.thecvf.com
This paper is not motivated to seek innovation within the attention mechanism. Instead it
focuses on overcoming the existing trade-offs between accuracy and efficiency within the …