Attention mechanisms in computer vision: A survey
Humans can naturally and effectively find salient regions in complex scenes. Motivated by
this observation, attention mechanisms were introduced into computer vision with the aim of …
this observation, attention mechanisms were introduced into computer vision with the aim of …
A review on the attention mechanism of deep learning
Attention has arguably become one of the most important concepts in the deep learning
field. It is inspired by the biological systems of humans that tend to focus on the distinctive …
field. It is inspired by the biological systems of humans that tend to focus on the distinctive …
Lisa: Reasoning segmentation via large language model
Although perception systems have made remarkable advancements in recent years they still
rely on explicit human instruction or pre-defined categories to identify the target objects …
rely on explicit human instruction or pre-defined categories to identify the target objects …
Rethinking semantic segmentation: A prototype view
Prevalent semantic segmentation solutions, despite their different network designs (FCN
based or attention based) and mask decoding strategies (parametric softmax based or pixel …
based or attention based) and mask decoding strategies (parametric softmax based or pixel …
SegFormer: Simple and efficient design for semantic segmentation with transformers
We present SegFormer, a simple, efficient yet powerful semantic segmentation framework
which unifies Transformers with lightweight multilayer perceptron (MLP) decoders …
which unifies Transformers with lightweight multilayer perceptron (MLP) decoders …
Segmenter: Transformer for semantic segmentation
Image segmentation is often ambiguous at the level of individual image patches and
requires contextual information to reach label consensus. In this paper we introduce …
requires contextual information to reach label consensus. In this paper we introduce …
Deep hierarchical semantic segmentation
Humans are able to recognize structured relations in observation, allowing us to decompose
complex scenes into simpler parts and abstract the visual world in multiple levels. However …
complex scenes into simpler parts and abstract the visual world in multiple levels. However …
Hrda: Context-aware high-resolution domain-adaptive semantic segmentation
Unsupervised domain adaptation (UDA) aims to adapt a model trained on the source
domain (eg synthetic data) to the target domain (eg real-world data) without requiring further …
domain (eg synthetic data) to the target domain (eg real-world data) without requiring further …
Multi-stage progressive image restoration
Image restoration tasks demand a complex balance between spatial details and high-level
contextualized information while recovering images. In this paper, we propose a novel …
contextualized information while recovering images. In this paper, we propose a novel …
Deep dual-resolution networks for real-time and accurate semantic segmentation of traffic scenes
Using light-weight architectures or reasoning on low-resolution images, recent methods
realize very fast scene parsing, even running at more than 100 FPS on a single GPU …
realize very fast scene parsing, even running at more than 100 FPS on a single GPU …