Vision transformer adapter for dense predictions
This work investigates a simple yet powerful adapter for Vision Transformer (ViT). Unlike
recent visual transformers that introduce vision-specific inductive biases into their …
recent visual transformers that introduce vision-specific inductive biases into their …
Oneformer: One transformer to rule universal image segmentation
Abstract Universal Image Segmentation is not a new concept. Past attempts to unify image
segmentation include scene parsing, panoptic segmentation, and, more recently, new …
segmentation include scene parsing, panoptic segmentation, and, more recently, new …
Hornet: Efficient high-order spatial interactions with recursive gated convolutions
Recent progress in vision Transformers exhibits great success in various tasks driven by the
new spatial modeling mechanism based on dot-product self-attention. In this paper, we …
new spatial modeling mechanism based on dot-product self-attention. In this paper, we …
Masked-attention mask transformer for universal image segmentation
Image segmentation groups pixels with different semantics, eg, category or instance
membership. Each choice of semantics defines a task. While only the semantics of each task …
membership. Each choice of semantics defines a task. While only the semantics of each task …
Pp-liteseg: A superior real-time semantic segmentation model
J Peng, Y Liu, S Tang, Y Hao, L Chu, G Chen… - arXiv preprint arXiv …, 2022 - arxiv.org
Real-world applications have high demands for semantic segmentation methods. Although
semantic segmentation has made remarkable leap-forwards with deep learning, the …
semantic segmentation has made remarkable leap-forwards with deep learning, the …
Hydra: A real-time spatial perception system for 3D scene graph construction and optimization
3D scene graphs have recently emerged as a powerful high-level representation of 3D
environments. A 3D scene graph describes the environment as a layered graph where …
environments. A 3D scene graph describes the environment as a layered graph where …
A comprehensive review of modern object segmentation approaches
Image segmentation is the task of associating pixels in an image with their respective object
class labels. It has a wide range of applications in many industries including healthcare …
class labels. It has a wide range of applications in many industries including healthcare …
Changer: Feature interaction is what you need for change detection
S Fang, K Li, Z Li - IEEE Transactions on Geoscience and …, 2023 - ieeexplore.ieee.org
Change detection is an important tool for long-term Earth observation missions. It takes bi-
temporal images as input and predicts “where” the change has occurred. Different from other …
temporal images as input and predicts “where” the change has occurred. Different from other …
Semask: Semantically masked transformers for semantic segmentation
Finetuning a pretrained backbone in the encoder part of an image transformer network has
been the traditional approach for the semantic segmentation task. However, such an …
been the traditional approach for the semantic segmentation task. However, such an …
Tripartite feature enhanced pyramid network for dense prediction
Learning pyramidal feature representations is important for many dense prediction tasks (eg,
object detection, semantic segmentation) that demand multi-scale visual understanding …
object detection, semantic segmentation) that demand multi-scale visual understanding …