Vision transformer adapter for dense predictions

Z Chen, Y Duan, W Wang, J He, T Lu, J Dai… - arXiv preprint arXiv …, 2022 - arxiv.org
This work investigates a simple yet powerful adapter for Vision Transformer (ViT). Unlike
recent visual transformers that introduce vision-specific inductive biases into their …

Oneformer: One transformer to rule universal image segmentation

J Jain, J Li, MT Chiu, A Hassani… - Proceedings of the …, 2023 - openaccess.thecvf.com
Abstract Universal Image Segmentation is not a new concept. Past attempts to unify image
segmentation include scene parsing, panoptic segmentation, and, more recently, new …

Hornet: Efficient high-order spatial interactions with recursive gated convolutions

Y Rao, W Zhao, Y Tang, J Zhou… - Advances in Neural …, 2022 - proceedings.neurips.cc
Recent progress in vision Transformers exhibits great success in various tasks driven by the
new spatial modeling mechanism based on dot-product self-attention. In this paper, we …

Masked-attention mask transformer for universal image segmentation

B Cheng, I Misra, AG Schwing… - Proceedings of the …, 2022 - openaccess.thecvf.com
Image segmentation groups pixels with different semantics, eg, category or instance
membership. Each choice of semantics defines a task. While only the semantics of each task …

Pp-liteseg: A superior real-time semantic segmentation model

J Peng, Y Liu, S Tang, Y Hao, L Chu, G Chen… - arXiv preprint arXiv …, 2022 - arxiv.org
Real-world applications have high demands for semantic segmentation methods. Although
semantic segmentation has made remarkable leap-forwards with deep learning, the …

Hydra: A real-time spatial perception system for 3D scene graph construction and optimization

N Hughes, Y Chang, L Carlone - arXiv preprint arXiv:2201.13360, 2022 - arxiv.org
3D scene graphs have recently emerged as a powerful high-level representation of 3D
environments. A 3D scene graph describes the environment as a layered graph where …

A comprehensive review of modern object segmentation approaches

Y Wang, U Ahsan, H Li, M Hagen - Foundations and Trends® …, 2022 - nowpublishers.com
Image segmentation is the task of associating pixels in an image with their respective object
class labels. It has a wide range of applications in many industries including healthcare …

Changer: Feature interaction is what you need for change detection

S Fang, K Li, Z Li - IEEE Transactions on Geoscience and …, 2023 - ieeexplore.ieee.org
Change detection is an important tool for long-term Earth observation missions. It takes bi-
temporal images as input and predicts “where” the change has occurred. Different from other …

Semask: Semantically masked transformers for semantic segmentation

J Jain, A Singh, N Orlov, Z Huang, J Li… - Proceedings of the …, 2023 - openaccess.thecvf.com
Finetuning a pretrained backbone in the encoder part of an image transformer network has
been the traditional approach for the semantic segmentation task. However, such an …

Tripartite feature enhanced pyramid network for dense prediction

D Liu, J Liang, T Geng, A Loui… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Learning pyramidal feature representations is important for many dense prediction tasks (eg,
object detection, semantic segmentation) that demand multi-scale visual understanding …