3DCTN: 3D convolution-transformer network for point cloud classification

D Lu, Q Xie, K Gao, L Xu, J Li - IEEE Transactions on Intelligent …, 2022 - ieeexplore.ieee.org
IEEE Transactions on Intelligent Transportation Systems, 2022ieeexplore.ieee.org
Point cloud classification is a fundamental task in 3D applications. However, it is challenging
to achieve effective feature learning due to the irregularity and unordered nature of point
clouds. Lately, 3D Transformers have been adopted to improve point cloud processing.
Nevertheless, massive Transformer layers tend to incur huge computational and memory
costs. This paper presented a novel hierarchical framework that incorporated convolutions
with Transformers for point cloud classification, named 3D Convolution-Transformer Network …
Point cloud classification is a fundamental task in 3D applications. However, it is challenging to achieve effective feature learning due to the irregularity and unordered nature of point clouds. Lately, 3D Transformers have been adopted to improve point cloud processing. Nevertheless, massive Transformer layers tend to incur huge computational and memory costs. This paper presented a novel hierarchical framework that incorporated convolutions with Transformers for point cloud classification, named 3D Convolution-Transformer Network (3DCTN). It combined the strong local feature learning ability of convolutions with the remarkable global context modeling capability of Transformers. Our method had two main modules operating on the downsampling point sets. Each module consisted of a multi-scale local feature aggregating (LFA) block and a global feature learning (GFL) block, which were implemented by using the Graph Convolution and Transformer respectively. We also conducted a detailed investigation on a series of self-attention variants to explore better performance for our network. Various experiments on ModelNet40 and ScanObjectNN datasets demonstrated that our method achieves state-of-the-art classification performance with a lightweight design. The code is publicly available at https://github.com/d62lu/3DCTN .
ieeexplore.ieee.org
以上显示的是最相近的搜索结果。 查看全部搜索结果