Beyond pixels: A comprehensive survey from bottom-up to semantic image segmentation and cosegmentation

H Zhu, F Meng, J Cai, S Lu - Journal of Visual Communication and Image …, 2016 - Elsevier
Image segmentation refers to the process to divide an image into meaningful non-
overlapping regions according to human perception, which has become a classic topic since …

A review of co-saliency detection algorithms: Fundamentals, applications, and challenges

D Zhang, H Fu, J Han, A Borji, X Li - ACM Transactions on Intelligent …, 2018 - dl.acm.org
Co-saliency detection is a newly emerging and rapidly growing research area in the
computer vision community. As a novel branch of visual saliency, co-saliency detection …

A tale of two features: Stable diffusion complements dino for zero-shot semantic correspondence

J Zhang, C Herrmann, J Hur… - Advances in …, 2024 - proceedings.neurips.cc
Text-to-image diffusion models have made significant advances in generating and editing
high-quality images. As a result, numerous approaches have explored the ability of diffusion …

[PDF][PDF] Deep vit features as dense visual descriptors

S Amir, Y Gandelsman, S Bagon… - arXiv preprint arXiv …, 2021 - dino-vit-features.github.io
We study the use of deep features extracted from a pretrained Vision Transformer (ViT) as
dense visual descriptors. We observe and empirically demonstrate that such features, when …

Revisiting unreasonable effectiveness of data in deep learning era

C Sun, A Shrivastava, S Singh… - Proceedings of the …, 2017 - openaccess.thecvf.com
The success of deep learning in vision can be attributed to:(a) models with high capacity;(b)
increased computational power; and (c) availability of large-scale labeled data. Since 2012 …

Segmenting objects from relational visual data

X Lu, W Wang, J Shen, DJ Crandall… - IEEE transactions on …, 2021 - ieeexplore.ieee.org
In this article, we model a set of pixelwise object segmentation tasks—automatic video
segmentation (AVS), image co-segmentation (ICS) and few-shot semantic segmentation …

Zero-shot video object segmentation via attentive graph neural networks

W Wang, X Lu, J Shen… - Proceedings of the …, 2019 - openaccess.thecvf.com
This work proposes a novel attentive graph neural network (AGNN) for zero-shot video
object segmentation (ZVOS). The suggested AGNN recasts this task as a process of iterative …

Learn to pay attention

S Jetley, NA Lord, N Lee, PHS Torr - arXiv preprint arXiv:1804.02391, 2018 - arxiv.org
We propose an end-to-end-trainable attention module for convolutional neural network
(CNN) architectures built for image classification. The module takes as input the 2D feature …

Convolutional neural network architecture for geometric matching

I Rocco, R Arandjelovic, J Sivic - Proceedings of the IEEE …, 2017 - openaccess.thecvf.com
We address the problem of determining correspondences between two images in
agreement with a geometric model such as an affine or thin-plate spline transformation, and …

Crnet: Cross-reference networks for few-shot segmentation

W Liu, C Zhang, G Lin, F Liu - Proceedings of the IEEE/CVF …, 2020 - openaccess.thecvf.com
Over the past few years, state-of-the-art image segmentation algorithms are based on deep
convolutional neural networks. To render a deep network with the ability to understand a …