VSCode: General Visual Salient and Camouflaged Object Detection with 2D Prompt Learning

Z Luo, N Liu, W Zhao, X Yang… - Proceedings of the …, 2024 - openaccess.thecvf.com
Salient object detection (SOD) and camouflaged object detection (COD) are related yet
distinct binary mapping tasks. These tasks involve multiple modalities sharing …

[HTML][HTML] Improving existing segmentators performance with zero-shot segmentators

L Nanni, D Fusaro, C Fantozzi, A Pretto - Entropy, 2023 - mdpi.com
This paper explores the potential of using the SAM (Segment-Anything Model) segmentator
to enhance the segmentation capability of known methods. SAM is a promptable …

VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks

J Wu, M Zhong, S Xing, Z Lai, Z Liu, W Wang… - arXiv preprint arXiv …, 2024 - arxiv.org
We present VisionLLM v2, an end-to-end generalist multimodal large model (MLLM) that
unifies visual perception, understanding, and generation within a single framework. Unlike …

Training Spatial-Frequency Visual Prompts and Probabilistic Clusters for Accurate Black-Box Transfer Learning

W Cho, K Kim, S Choi, J Choo - Proceedings of the 32nd ACM …, 2024 - dl.acm.org
Despite the growing prevalence of black-box pre-trained models (PTMs) such as prediction
API services, there remains a significant challenge in directly applying general models to …

Towards Real Zero-Shot Camouflaged Object Segmentation without Camouflaged Annotations

C Lei, J Fan, X Li, T Xiang, A Li, C Zhu… - arXiv preprint arXiv …, 2024 - arxiv.org
Camouflaged Object Segmentation (COS) faces significant challenges due to the scarcity of
annotated data, where meticulous pixel-level annotation is both labor-intensive and costly …

ForgeryTTT: Zero-Shot Image Manipulation Localization with Test-Time Training

W Liu, X Shen, CM Pun, X Cun - arXiv preprint arXiv:2410.04032, 2024 - arxiv.org
Social media is increasingly plagued by realistic fake images, making it hard to trust content.
Previous algorithms to detect these fakes often fail in new, real-world scenarios because …

A Simple yet Effective Network based on Vision Transformer for Camouflaged Object and Salient Object Detection

C Hao, Z Yu, X Liu, J Xu, H Yue, J Yang - arXiv preprint arXiv:2402.18922, 2024 - arxiv.org
Camouflaged object detection (COD) and salient object detection (SOD) are two distinct yet
closely-related computer vision tasks widely studied during the past decades. Though …

GFHANet: Global Feature Hybrid Attention Network for Salient Object Detection in Side-Scan Sonar Images

SA Yuan, Z Wang, FL He, SW Zhang, ZY Zhao - IEEE Access, 2024 - ieeexplore.ieee.org
With the wide application of deep learning in image processing, salient object detection
(SOD) in underwater sonar images has become an important research topic. However, due …

DiffPrompter: Differentiable Implicit Visual Prompts for Semantic-Segmentation in Adverse Conditions

S Kalwar, M Ungarala, S Jain, A Monis… - arXiv preprint arXiv …, 2023 - arxiv.org
Semantic segmentation in adverse weather scenarios is a critical task for autonomous
driving systems. While foundation models have shown promise, the need for specialized …

External Prompt Features Enhanced Parameter-Efficient Fine-Tuning for Salient Object Detection

W Liang, P Ran, M Bai, X Liu, PB Githinji… - … Conference on Pattern …, 2024 - Springer
Salient object detection (SOD) aims at finding the most salient objects in images and outputs
pixel-level binary masks. Transformer-based methods achieve promising performance due …