Beyond pixels: A comprehensive survey from bottom-up to semantic image segmentation and cosegmentation
Image segmentation refers to the process to divide an image into meaningful non-
overlapping regions according to human perception, which has become a classic topic since …
overlapping regions according to human perception, which has become a classic topic since …
A review of co-saliency detection algorithms: Fundamentals, applications, and challenges
Co-saliency detection is a newly emerging and rapidly growing research area in the
computer vision community. As a novel branch of visual saliency, co-saliency detection …
computer vision community. As a novel branch of visual saliency, co-saliency detection …
A tale of two features: Stable diffusion complements dino for zero-shot semantic correspondence
Text-to-image diffusion models have made significant advances in generating and editing
high-quality images. As a result, numerous approaches have explored the ability of diffusion …
high-quality images. As a result, numerous approaches have explored the ability of diffusion …
[PDF][PDF] Deep vit features as dense visual descriptors
We study the use of deep features extracted from a pretrained Vision Transformer (ViT) as
dense visual descriptors. We observe and empirically demonstrate that such features, when …
dense visual descriptors. We observe and empirically demonstrate that such features, when …
Revisiting unreasonable effectiveness of data in deep learning era
The success of deep learning in vision can be attributed to:(a) models with high capacity;(b)
increased computational power; and (c) availability of large-scale labeled data. Since 2012 …
increased computational power; and (c) availability of large-scale labeled data. Since 2012 …
Segmenting objects from relational visual data
In this article, we model a set of pixelwise object segmentation tasks—automatic video
segmentation (AVS), image co-segmentation (ICS) and few-shot semantic segmentation …
segmentation (AVS), image co-segmentation (ICS) and few-shot semantic segmentation …
Zero-shot video object segmentation via attentive graph neural networks
This work proposes a novel attentive graph neural network (AGNN) for zero-shot video
object segmentation (ZVOS). The suggested AGNN recasts this task as a process of iterative …
object segmentation (ZVOS). The suggested AGNN recasts this task as a process of iterative …
Learn to pay attention
We propose an end-to-end-trainable attention module for convolutional neural network
(CNN) architectures built for image classification. The module takes as input the 2D feature …
(CNN) architectures built for image classification. The module takes as input the 2D feature …
Convolutional neural network architecture for geometric matching
We address the problem of determining correspondences between two images in
agreement with a geometric model such as an affine or thin-plate spline transformation, and …
agreement with a geometric model such as an affine or thin-plate spline transformation, and …
Crnet: Cross-reference networks for few-shot segmentation
Over the past few years, state-of-the-art image segmentation algorithms are based on deep
convolutional neural networks. To render a deep network with the ability to understand a …
convolutional neural networks. To render a deep network with the ability to understand a …