Depth-aware cnn for rgb-d segmentation

S Minaee, Y Boykov, F Porikli, A Plaza… - IEEE transactions on …, 2021 - ieeexplore.ieee.org

Image segmentation is a key task in computer vision and image processing with important
applications such as scene understanding, medical image analysis, robotic perception …

被引用次数：3970 相关文章所有 13 个版本

A brief survey on semantic segmentation with deep learning

S Hao, Y Zhou, Y Guo - Neurocomputing, 2020 - Elsevier

Semantic segmentation is a challenging task in computer vision. In recent years, the
performance of semantic segmentation has been greatly improved by using deep learning …

被引用次数：550 相关文章所有 2 个版本

Segment anything in 3d with nerfs

J Cen, Z Zhou, J Fang, W Shen, L Xie… - Advances in …, 2023 - proceedings.neurips.cc

Abstract Recently, the Segment Anything Model (SAM) emerged as a powerful vision
foundation model which is capable to segment anything in 2D images. This paper aims to …

被引用次数：134 相关文章所有 4 个版本

[PDF] arxiv.org

Binsformer: Revisiting adaptive bins for monocular depth estimation

Z Li, X Wang, X Liu, J Jiang - IEEE Transactions on Image …, 2024 - ieeexplore.ieee.org

Monocular depth estimation (MDE) is a fundamental task in computer vision and has drawn
increasing attention. Recently, some methods reformulate it as a classification-regression …

被引用次数：187 相关文章所有 2 个版本

GMNet: Graded-feature multilabel-learning network for RGB-thermal urban scene semantic segmentation

W Zhou, J Liu, J Lei, L Yu… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org

Semantic segmentation is a fundamental task in computer vision, and it has various
applications in fields such as robotic sensing, video surveillance, and autonomous driving. A …

被引用次数：246 相关文章所有 5 个版本

[PDF] thecvf.com

Sparse fuse dense: Towards high quality 3d detection with depth completion

X Wu, L Peng, H Yang, L Xie… - Proceedings of the …, 2022 - openaccess.thecvf.com

Current LiDAR-only 3D detection methods inevitably suffer from the sparsity of point clouds.
Many multi-modal methods are proposed to alleviate this issue, while different …

被引用次数：220 相关文章所有 6 个版本

[PDF] arxiv.org

CMX: Cross-modal fusion for RGB-X semantic segmentation with transformers

J Zhang, H Liu, K Yang, X Hu, R Liu… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

Scene understanding based on image segmentation is a crucial component of autonomous
vehicles. Pixel-wise semantic segmentation of RGB images can be advanced by exploiting …

被引用次数：233 相关文章所有 7 个版本

[PDF] arxiv.org

Visual transformers: Token-based image representation and processing for computer vision

B Wu, C Xu, X Dai, A Wan, P Zhang, Z Yan… - arXiv preprint arXiv …, 2020 - arxiv.org

Computer vision has achieved remarkable success by (a) representing images as uniformly-
arranged pixel arrays and (b) convolving highly-localized features. However, convolutions …

被引用次数：625 相关文章所有 3 个版本

[PDF] arxiv.org

Bi-directional cross-modality feature propagation with separation-and-aggregation gate for RGB-D semantic segmentation

X Chen, KY Lin, J Wang, W Wu, C Qian, H Li… - European conference on …, 2020 - Springer

Depth information has proven to be a useful cue in the semantic segmentation of RGB-D
images for providing a geometric counterpart to the RGB representation. Most existing works …

被引用次数：378 相关文章所有 6 个版本

[PDF] arxiv.org

Deep learning for image super-resolution: A survey

Z Wang, J Chen, SCH Hoi - IEEE transactions on pattern …, 2020 - ieeexplore.ieee.org

Image Super-Resolution (SR) is an important class of image processing techniqueso
enhance the resolution of images and videos in computer vision. Recent years have …

被引用次数：1886 相关文章所有 10 个版本