Attention mechanisms in computer vision: A survey

MH Guo, TX Xu, JJ Liu, ZN Liu, PT Jiang, TJ Mu… - Computational visual …, 2022 - Springer
Humans can naturally and effectively find salient regions in complex scenes. Motivated by
this observation, attention mechanisms were introduced into computer vision with the aim of …

Deep learning for 3d point clouds: A survey

Y Guo, H Wang, Q Hu, H Liu, L Liu… - IEEE transactions on …, 2020 - ieeexplore.ieee.org
Point cloud learning has lately attracted increasing attention due to its wide applications in
many areas, such as computer vision, autonomous driving, and robotics. As a dominating …

Rethinking network design and local geometry in point cloud: A simple residual MLP framework

X Ma, C Qin, H You, H Ran, Y Fu - arXiv preprint arXiv:2202.07123, 2022 - arxiv.org
Point cloud analysis is challenging due to irregularity and unordered data structure. To
capture the 3D geometries, prior works mainly rely on exploring sophisticated local …

Revisiting point cloud classification: A new benchmark dataset and classification model on real-world data

MA Uy, QH Pham, BS Hua… - Proceedings of the …, 2019 - openaccess.thecvf.com
Deep learning techniques for point cloud data have demonstrated great potentials in solving
classical problems in 3D computer vision such as 3D object classification and segmentation …

Deep learning for lidar point clouds in autonomous driving: A review

Y Li, L Ma, Z Zhong, F Liu… - … on Neural Networks …, 2020 - ieeexplore.ieee.org
Recently, the advancement of deep learning (DL) in discriminative feature learning from 3-D
LiDAR data has led to rapid development in the field of autonomous driving. However …

Mvtn: Multi-view transformation network for 3d shape recognition

A Hamdi, S Giancola… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Multi-view projection methods have demonstrated their ability to reach state-of-the-art
performance on 3D shape recognition. Those methods learn different ways to aggregate …

Clip goes 3d: Leveraging prompt tuning for language grounded 3d recognition

D Hegde, JMJ Valanarasu… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Vision-Language models like CLIP have been widely adopted for various tasks due to their
impressive zero-shot capabilities. However, CLIP is not suitable for extracting 3D geometric …

Meshnet: Mesh neural network for 3d shape representation

Y Feng, Y Feng, H You, X Zhao, Y Gao - Proceedings of the AAAI …, 2019 - ojs.aaai.org
Mesh is an important and powerful type of data for 3D shapes and widely studied in the field
of computer vision and computer graphics. Regarding the task of 3D shape representation …

Pointdan: A multi-scale 3d domain adaption network for point cloud representation

C Qin, H You, L Wang, CCJ Kuo… - Advances in Neural …, 2019 - proceedings.neurips.cc
Abstract Domain Adaptation (DA) approaches achieved significant improvements in a wide
range of machine learning and computer vision tasks (ie, classification, detection, and …

Voxel-based three-view hybrid parallel network for 3D object classification

W Cai, D Liu, X Ning, C Wang, G Xie - Displays, 2021 - Elsevier
Three-dimensional models are widely used in the fields of multimedia, computer graphics,
virtual reality, entertainment, design, and manufacturing because of the rich information that …