Gated fully fusion for semantic segmentation

J Janai, F Güney, A Behl, A Geiger - Foundations and Trends® …, 2020 - nowpublishers.com

Recent years have witnessed enormous progress in AI-related fields such as computer
vision, machine learning, and autonomous vehicles. As with any rapidly growing field, it …

被引用次数：1139 相关文章所有 9 个版本

[PDF] hal.science

Deep multimodal fusion for semantic image segmentation: A survey

Y Zhang, D Sidibé, O Morel, F Mériaudeau - Image and Vision Computing, 2021 - Elsevier

Recent advances in deep learning have shown excellent performance in various scene
understanding tasks. However, in some complex environments or under challenging …

被引用次数：205 相关文章所有 5 个版本

[PDF] neurips.cc

Segnext: Rethinking convolutional attention design for semantic segmentation

MH Guo, CZ Lu, Q Hou, Z Liu… - Advances in Neural …, 2022 - proceedings.neurips.cc

We present SegNeXt, a simple convolutional network architecture for semantic
segmentation. Recent transformer-based models have dominated the field of se-mantic …

被引用次数：657 相关文章所有 6 个版本

[PDF] ieee.org

Transformer-based visual segmentation: A survey

X Li, H Ding, H Yuan, W Zhang, J Pang… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org

Visual segmentation seeks to partition images, video frames, or point clouds into multiple
segments or groups. This technique has numerous real-world applications, such as …

被引用次数：107 相关文章所有 3 个版本

[PDF] neurips.cc

Segvit: Semantic segmentation with plain vision transformers

B Zhang, Z Tian, Q Tang, X Chu… - Advances in Neural …, 2022 - proceedings.neurips.cc

We explore the capability of plain Vision Transformers (ViTs) for semantic segmentation and
propose the SegViT. Previous ViT-based segmentation networks usually learn a pixel-level …

被引用次数：124 相关文章所有 6 个版本

[PDF] thecvf.com

Rethinking semantic segmentation from a sequence-to-sequence perspective with transformers

S Zheng, J Lu, H Zhao, X Zhu, Z Luo… - Proceedings of the …, 2021 - openaccess.thecvf.com

Most recent semantic segmentation methods adopt a fully-convolutional network (FCN) with
an encoder-decoder architecture. The encoder progressively reduces the spatial resolution …

被引用次数：3647 相关文章所有 10 个版本

[PDF] ieee.org

Towards open vocabulary learning: A survey

J Wu, X Li, S Xu, H Yuan, H Ding… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org

In the field of visual scene understanding, deep neural networks have made impressive
advancements in various core tasks like segmentation, tracking, and detection. However …

被引用次数：116 相关文章所有 10 个版本

[PDF] thecvf.com

Rpvnet: A deep and efficient range-point-voxel fusion network for lidar point cloud segmentation

J Xu, R Zhang, J Dou, Y Zhu… - Proceedings of the …, 2021 - openaccess.thecvf.com

Point clouds can be represented in many forms (views), typically, point-based sets, voxel-
based cells or range-based images (ie, panoramic view). The point-based view is …

被引用次数：311 相关文章所有 7 个版本

[PDF] arxiv.org

Axial-deeplab: Stand-alone axial-attention for panoptic segmentation

H Wang, Y Zhu, B Green, H Adam, A Yuille… - European conference on …, 2020 - Springer

Convolution exploits locality for efficiency at a cost of missing long range context. Self-
attention has been adopted to augment CNNs with non-local interactions. Recent works …

被引用次数：872 相关文章所有 9 个版本

[PDF] thecvf.com

Panoptic-deeplab: A simple, strong, and fast baseline for bottom-up panoptic segmentation

B Cheng, MD Collins, Y Zhu, T Liu… - Proceedings of the …, 2020 - openaccess.thecvf.com

In this work, we introduce Panoptic-DeepLab, a simple, strong, and fast system for panoptic
segmentation, aiming to establish a solid baseline for bottom-up methods that can achieve …

被引用次数：725 相关文章所有 8 个版本