Pyramid constrained self-attention network for fast video salient object detection

Y Zhou, Z Li, CL Guo, S Bai… - Proceedings of the …, 2023 - openaccess.thecvf.com

Previous works have shown that increasing the window size for Transformer-based image
super-resolution models (eg, SwinIR) can significantly improve the model performance but …

被引用次数：79 相关文章所有 5 个版本

[PDF] arxiv.org

Visual semantic segmentation based on few/zero-shot learning: An overview

W Ren, Y Tang, Q Sun, C Zhao… - IEEE/CAA Journal of …, 2023 - ieeexplore.ieee.org

Visual semantic segmentation aims at separating a visual sample into diverse blocks with
specific semantic attributes and identifying the category for each block, and it plays a crucial …

被引用次数：22 相关文章所有 7 个版本

[PDF] thecvf.com

Full-duplex strategy for video object segmentation

GP Ji, K Fu, Z Wu, DP Fan, J Shen… - Proceedings of the …, 2021 - openaccess.thecvf.com

Appearance and motion are two important sources of information in video object
segmentation (VOS). Previous methods mainly focus on using simplex solutions, lowering …

被引用次数：145 相关文章所有 13 个版本

[PDF] arxiv.org

Siamese network for RGB-D salient object detection and beyond

K Fu, DP Fan, GP Ji, Q Zhao, J Shen… - IEEE transactions on …, 2021 - ieeexplore.ieee.org

Existing RGB-D salient object detection (SOD) models usually treat RGB and depth as
independent information and design separate networks for feature extraction from each …

被引用次数：217 相关文章所有 8 个版本

[PDF] arxiv.org

Video transformers: A survey

J Selva, AS Johansen, S Escalera… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org

Transformer models have shown great success handling long-range interactions, making
them a promising tool for modeling video. However, they lack inductive biases and scale …

被引用次数：97 相关文章所有 8 个版本

[PDF] arxiv.org

A survey on deep learning technique for video segmentation

T Zhou, F Porikli, DJ Crandall… - IEEE transactions on …, 2022 - ieeexplore.ieee.org

Video segmentation—partitioning video frames into multiple segments or objects—plays a
critical role in a broad range of practical applications, from enhancing visual effects in movie …

被引用次数：129 相关文章所有 9 个版本

[PDF] ieee.org

A comprehensive survey on video saliency detection with auditory information: the audio-visual consistency perceptual is the key!

C Chen, M Song, W Song, L Guo… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org

Video saliency detection (VSD) aims at fast locating the most attractive
objects/things/patterns in a given video clip. Existing VSD-related works have mainly relied …

被引用次数：21 相关文章所有 5 个版本

[PDF] thecvf.com

Dynamic context-sensitive filtering network for video salient object detection

M Zhang, J Liu, Y Wang, Y Piao… - Proceedings of the …, 2021 - openaccess.thecvf.com

The ability to capture inter-frame dynamics has been critical to the development of video
salient object detection (VSOD). While many works have achieved great success in this field …

被引用次数：101 相关文章所有 4 个版本

[PDF] springer.com

Video polyp segmentation: A deep learning perspective

GP Ji, G Xiao, YC Chou, DP Fan, K Zhao… - Machine Intelligence …, 2022 - Springer

We present the first comprehensive video polyp segmentation (VPS) study in the deep
learning era. Over the years, developments in VPS are not moving forward with ease due to …

被引用次数：78 相关文章所有 9 个版本

[PDF] arxiv.org

Progressively normalized self-attention network for video polyp segmentation

GP Ji, YC Chou, DP Fan, G Chen, H Fu, D Jha… - … Conference on Medical …, 2021 - Springer

Existing video polyp segmentation (VPS) models typically employ convolutional neural
networks (CNNs) to extract features. However, due to their limited receptive fields, CNNs …

被引用次数：133 相关文章所有 5 个版本