Unsupervised joint object discovery and segmentation in internet images

Beyond pixels: A comprehensive survey from bottom-up to semantic image segmentation and cosegmentation

H Zhu, F Meng, J Cai, S Lu - Journal of Visual Communication and Image …, 2016 - Elsevier

Image segmentation refers to the process to divide an image into meaningful non-
overlapping regions according to human perception, which has become a classic topic since …

被引用次数：303 相关文章所有 6 个版本

A review of co-saliency detection algorithms: Fundamentals, applications, and challenges

D Zhang, H Fu, J Han, A Borji, X Li - ACM Transactions on Intelligent …, 2018 - dl.acm.org

Co-saliency detection is a newly emerging and rapidly growing research area in the
computer vision community. As a novel branch of visual saliency, co-saliency detection …

被引用次数：133 相关文章所有 3 个版本

[PDF] neurips.cc

A tale of two features: Stable diffusion complements dino for zero-shot semantic correspondence

J Zhang, C Herrmann, J Hur… - Advances in …, 2024 - proceedings.neurips.cc

Text-to-image diffusion models have made significant advances in generating and editing
high-quality images. As a result, numerous approaches have explored the ability of diffusion …

被引用次数：90 相关文章所有 8 个版本

[PDF] github.io

[PDF][PDF] Deep vit features as dense visual descriptors

S Amir, Y Gandelsman, S Bagon… - arXiv preprint arXiv …, 2021 - dino-vit-features.github.io

We study the use of deep features extracted from a pretrained Vision Transformer (ViT) as
dense visual descriptors. We observe and empirically demonstrate that such features, when …

被引用次数：247 相关文章所有 3 个版本

[PDF] thecvf.com

Revisiting unreasonable effectiveness of data in deep learning era

C Sun, A Shrivastava, S Singh… - Proceedings of the …, 2017 - openaccess.thecvf.com

The success of deep learning in vision can be attributed to:(a) models with high capacity;(b)
increased computational power; and (c) availability of large-scale labeled data. Since 2012 …

被引用次数：3062 相关文章所有 12 个版本

[PDF] ethz.ch

Segmenting objects from relational visual data

X Lu, W Wang, J Shen, DJ Crandall… - IEEE transactions on …, 2021 - ieeexplore.ieee.org

In this article, we model a set of pixelwise object segmentation tasks—automatic video
segmentation (AVS), image co-segmentation (ICS) and few-shot semantic segmentation …

被引用次数：128 相关文章所有 8 个版本

[PDF] thecvf.com

Zero-shot video object segmentation via attentive graph neural networks

W Wang, X Lu, J Shen… - Proceedings of the …, 2019 - openaccess.thecvf.com

This work proposes a novel attentive graph neural network (AGNN) for zero-shot video
object segmentation (ZVOS). The suggested AGNN recasts this task as a process of iterative …

被引用次数：320 相关文章所有 11 个版本

[PDF] arxiv.org

Learn to pay attention

S Jetley, NA Lord, N Lee, PHS Torr - arXiv preprint arXiv:1804.02391, 2018 - arxiv.org

We propose an end-to-end-trainable attention module for convolutional neural network
(CNN) architectures built for image classification. The module takes as input the 2D feature …

被引用次数：547 相关文章所有 6 个版本

[PDF] thecvf.com

Convolutional neural network architecture for geometric matching

I Rocco, R Arandjelovic, J Sivic - Proceedings of the IEEE …, 2017 - openaccess.thecvf.com

We address the problem of determining correspondences between two images in
agreement with a geometric model such as an affine or thin-plate spline transformation, and …

被引用次数：651 相关文章所有 17 个版本

[PDF] thecvf.com

Crnet: Cross-reference networks for few-shot segmentation

W Liu, C Zhang, G Lin, F Liu - Proceedings of the IEEE/CVF …, 2020 - openaccess.thecvf.com

Over the past few years, state-of-the-art image segmentation algorithms are based on deep
convolutional neural networks. To render a deep network with the ability to understand a …

被引用次数：247 相关文章所有 8 个版本