Srformer: Permuted self-attention for single image super-resolution

Y Zhou, Z Li, CL Guo, S Bai… - Proceedings of the …, 2023 - openaccess.thecvf.com
Previous works have shown that increasing the window size for Transformer-based image
super-resolution models (eg, SwinIR) can significantly improve the model performance but …

Visual semantic segmentation based on few/zero-shot learning: An overview

W Ren, Y Tang, Q Sun, C Zhao… - IEEE/CAA Journal of …, 2023 - ieeexplore.ieee.org
Visual semantic segmentation aims at separating a visual sample into diverse blocks with
specific semantic attributes and identifying the category for each block, and it plays a crucial …

Full-duplex strategy for video object segmentation

GP Ji, K Fu, Z Wu, DP Fan, J Shen… - Proceedings of the …, 2021 - openaccess.thecvf.com
Appearance and motion are two important sources of information in video object
segmentation (VOS). Previous methods mainly focus on using simplex solutions, lowering …

Siamese network for RGB-D salient object detection and beyond

K Fu, DP Fan, GP Ji, Q Zhao, J Shen… - IEEE transactions on …, 2021 - ieeexplore.ieee.org
Existing RGB-D salient object detection (SOD) models usually treat RGB and depth as
independent information and design separate networks for feature extraction from each …

Video transformers: A survey

J Selva, AS Johansen, S Escalera… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
Transformer models have shown great success handling long-range interactions, making
them a promising tool for modeling video. However, they lack inductive biases and scale …

A survey on deep learning technique for video segmentation

T Zhou, F Porikli, DJ Crandall… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
Video segmentation—partitioning video frames into multiple segments or objects—plays a
critical role in a broad range of practical applications, from enhancing visual effects in movie …

A comprehensive survey on video saliency detection with auditory information: the audio-visual consistency perceptual is the key!

C Chen, M Song, W Song, L Guo… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Video saliency detection (VSD) aims at fast locating the most attractive
objects/things/patterns in a given video clip. Existing VSD-related works have mainly relied …

Dynamic context-sensitive filtering network for video salient object detection

M Zhang, J Liu, Y Wang, Y Piao… - Proceedings of the …, 2021 - openaccess.thecvf.com
The ability to capture inter-frame dynamics has been critical to the development of video
salient object detection (VSOD). While many works have achieved great success in this field …

Video polyp segmentation: A deep learning perspective

GP Ji, G Xiao, YC Chou, DP Fan, K Zhao… - Machine Intelligence …, 2022 - Springer
We present the first comprehensive video polyp segmentation (VPS) study in the deep
learning era. Over the years, developments in VPS are not moving forward with ease due to …

Progressively normalized self-attention network for video polyp segmentation

GP Ji, YC Chou, DP Fan, G Chen, H Fu, D Jha… - … Conference on Medical …, 2021 - Springer
Existing video polyp segmentation (VPS) models typically employ convolutional neural
networks (CNNs) to extract features. However, due to their limited receptive fields, CNNs …