Psanet: Point-wise spatial attention network for scene parsing

MH Guo, TX Xu, JJ Liu, ZN Liu, PT Jiang, TJ Mu… - Computational visual …, 2022 - Springer

Humans can naturally and effectively find salient regions in complex scenes. Motivated by
this observation, attention mechanisms were introduced into computer vision with the aim of …

被引用次数：1281 相关文章所有 10 个版本

A review on the attention mechanism of deep learning

Z Niu, G Zhong, H Yu - Neurocomputing, 2021 - Elsevier

Attention has arguably become one of the most important concepts in the deep learning
field. It is inspired by the biological systems of humans that tend to focus on the distinctive …

被引用次数：1568 相关文章所有 4 个版本

[PDF] thecvf.com

Lisa: Reasoning segmentation via large language model

X Lai, Z Tian, Y Chen, Y Li, Y Yuan… - Proceedings of the …, 2024 - openaccess.thecvf.com

Although perception systems have made remarkable advancements in recent years they still
rely on explicit human instruction or pre-defined categories to identify the target objects …

被引用次数：151 相关文章所有 2 个版本

[PDF] thecvf.com

Rethinking semantic segmentation: A prototype view

T Zhou, W Wang, E Konukoglu… - Proceedings of the …, 2022 - openaccess.thecvf.com

Prevalent semantic segmentation solutions, despite their different network designs (FCN
based or attention based) and mask decoding strategies (parametric softmax based or pixel …

被引用次数：240 相关文章所有 11 个版本

[PDF] neurips.cc

SegFormer: Simple and efficient design for semantic segmentation with transformers

E Xie, W Wang, Z Yu, A Anandkumar… - Advances in neural …, 2021 - proceedings.neurips.cc

We present SegFormer, a simple, efficient yet powerful semantic segmentation framework
which unifies Transformers with lightweight multilayer perceptron (MLP) decoders …

被引用次数：3516 相关文章所有 10 个版本

[PDF] thecvf.com

Segmenter: Transformer for semantic segmentation

R Strudel, R Garcia, I Laptev… - Proceedings of the …, 2021 - openaccess.thecvf.com

Image segmentation is often ambiguous at the level of individual image patches and
requires contextual information to reach label consensus. In this paper we introduce …

被引用次数：1500 相关文章所有 13 个版本

[PDF] thecvf.com

Deep hierarchical semantic segmentation

L Li, T Zhou, W Wang, J Li… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com

Humans are able to recognize structured relations in observation, allowing us to decompose
complex scenes into simpler parts and abstract the visual world in multiple levels. However …

被引用次数：126 相关文章所有 9 个版本

[PDF] arxiv.org

Hrda: Context-aware high-resolution domain-adaptive semantic segmentation

L Hoyer, D Dai, L Van Gool - European conference on computer vision, 2022 - Springer

Unsupervised domain adaptation (UDA) aims to adapt a model trained on the source
domain (eg synthetic data) to the target domain (eg real-world data) without requiring further …

被引用次数：166 相关文章所有 11 个版本

[PDF] thecvf.com

Multi-stage progressive image restoration

SW Zamir, A Arora, S Khan, M Hayat… - Proceedings of the …, 2021 - openaccess.thecvf.com

Image restoration tasks demand a complex balance between spatial details and high-level
contextualized information while recovering images. In this paper, we propose a novel …

被引用次数：1400 相关文章所有 12 个版本

Deep dual-resolution networks for real-time and accurate semantic segmentation of traffic scenes

H Pan, Y Hong, W Sun, Y Jia - IEEE Transactions on Intelligent …, 2022 - ieeexplore.ieee.org

Using light-weight architectures or reasoning on low-resolution images, recent methods
realize very fast scene parsing, even running at more than 100 FPS on a single GPU …

被引用次数：123 相关文章所有 3 个版本