Image segmentation using deep learning: A survey

S Minaee, Y Boykov, F Porikli, A Plaza… - IEEE transactions on …, 2021 - ieeexplore.ieee.org
Image segmentation is a key task in computer vision and image processing with important
applications such as scene understanding, medical image analysis, robotic perception …

A brief survey on semantic segmentation with deep learning

S Hao, Y Zhou, Y Guo - Neurocomputing, 2020 - Elsevier
Semantic segmentation is a challenging task in computer vision. In recent years, the
performance of semantic segmentation has been greatly improved by using deep learning …

Segment anything in 3d with nerfs

J Cen, Z Zhou, J Fang, W Shen, L Xie… - Advances in …, 2023 - proceedings.neurips.cc
Abstract Recently, the Segment Anything Model (SAM) emerged as a powerful vision
foundation model which is capable to segment anything in 2D images. This paper aims to …

Binsformer: Revisiting adaptive bins for monocular depth estimation

Z Li, X Wang, X Liu, J Jiang - IEEE Transactions on Image …, 2024 - ieeexplore.ieee.org
Monocular depth estimation (MDE) is a fundamental task in computer vision and has drawn
increasing attention. Recently, some methods reformulate it as a classification-regression …

GMNet: Graded-feature multilabel-learning network for RGB-thermal urban scene semantic segmentation

W Zhou, J Liu, J Lei, L Yu… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Semantic segmentation is a fundamental task in computer vision, and it has various
applications in fields such as robotic sensing, video surveillance, and autonomous driving. A …

Sparse fuse dense: Towards high quality 3d detection with depth completion

X Wu, L Peng, H Yang, L Xie… - Proceedings of the …, 2022 - openaccess.thecvf.com
Current LiDAR-only 3D detection methods inevitably suffer from the sparsity of point clouds.
Many multi-modal methods are proposed to alleviate this issue, while different …

CMX: Cross-modal fusion for RGB-X semantic segmentation with transformers

J Zhang, H Liu, K Yang, X Hu, R Liu… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Scene understanding based on image segmentation is a crucial component of autonomous
vehicles. Pixel-wise semantic segmentation of RGB images can be advanced by exploiting …

Visual transformers: Token-based image representation and processing for computer vision

B Wu, C Xu, X Dai, A Wan, P Zhang, Z Yan… - arXiv preprint arXiv …, 2020 - arxiv.org
Computer vision has achieved remarkable success by (a) representing images as uniformly-
arranged pixel arrays and (b) convolving highly-localized features. However, convolutions …

Bi-directional cross-modality feature propagation with separation-and-aggregation gate for RGB-D semantic segmentation

X Chen, KY Lin, J Wang, W Wu, C Qian, H Li… - European conference on …, 2020 - Springer
Depth information has proven to be a useful cue in the semantic segmentation of RGB-D
images for providing a geometric counterpart to the RGB representation. Most existing works …

Deep learning for image super-resolution: A survey

Z Wang, J Chen, SCH Hoi - IEEE transactions on pattern …, 2020 - ieeexplore.ieee.org
Image Super-Resolution (SR) is an important class of image processing techniqueso
enhance the resolution of images and videos in computer vision. Recent years have …