Video semantic segmentation via sparse temporal transformer

G Sun, Y Liu, H Ding, T Probst… - proceedings of the …, 2022 - openaccess.thecvf.com

The contextual information plays a core role in semantic segmentation. As for video
semantic segmentation, the contexts include static contexts and motional contexts …

被引用次数：59 相关文章所有 9 个版本

[PDF] thecvf.com

Isomer: Isomerous transformer for zero-shot video object segmentation

Y Yuan, Y Wang, L Wang, X Zhao… - Proceedings of the …, 2023 - openaccess.thecvf.com

Recent leading zero-shot video object segmentation (ZVOS) works devote to integrating
appearance and motion information by elaborately designing feature fusion modules and …

被引用次数：9 相关文章所有 7 个版本

[PDF] thecvf.com

Multispectral video semantic segmentation: A benchmark dataset and baseline

W Ji, J Li, C Bian, Z Zhou, J Zhao… - Proceedings of the …, 2023 - openaccess.thecvf.com

Robust and reliable semantic segmentation in complex scenes is crucial for many real-life
applications such as autonomous safe driving and nighttime rescue. In most approaches, it …

被引用次数：19 相关文章所有 4 个版本

[PDF] thecvf.com

Neural video depth stabilizer

Y Wang, M Shi, J Li, Z Huang, Z Cao… - Proceedings of the …, 2023 - openaccess.thecvf.com

Video depth estimation aims to infer temporally consistent depth. Some methods achieve
temporal consistency by finetuning a single-image depth model during test time using …

被引用次数：13 相关文章所有 6 个版本

[PDF] arxiv.org

Mining relations among cross-frame affinities for video semantic segmentation

G Sun, Y Liu, H Tang, A Chhatkuli, L Zhang… - … on Computer Vision, 2022 - Springer

The essence of video semantic segmentation (VSS) is how to leverage temporal information
for prediction. Previous efforts are mainly devoted to developing new techniques to calculate …

被引用次数：31 相关文章所有 9 个版本

[PDF] neurips.cc

Mask propagation for efficient video semantic segmentation

Y Weng, M Han, H He, M Li, L Yao… - Advances in …, 2024 - proceedings.neurips.cc

Abstract Video Semantic Segmentation (VSS) involves assigning a semantic label to each
pixel in a video sequence. Prior work in this field has demonstrated promising results by …

被引用次数：8 相关文章所有 6 个版本

[PDF] thecvf.com

Latency matters: Real-time action forecasting transformer

H Girase, N Agarwal, C Choi… - Proceedings of the …, 2023 - openaccess.thecvf.com

We present RAFTformer, a real-time action forecasting transformer for latency aware real-
world action forecasting applications. RAFTformer is a two-stage fully transformer based …

被引用次数：13 相关文章所有 3 个版本

[PDF] thecvf.com

Combining implicit-explicit view correlation for light field semantic segmentation

R Cong, D Yang, R Chen, S Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com

Since light field simultaneously records spatial information and angular information of light
rays, it is considered to be beneficial for many potential applications, and semantic …

被引用次数：18 相关文章所有 3 个版本

[PDF] thecvf.com

Vanishing-point-guided video semantic segmentation of driving scenes

D Guo, DP Fan, T Lu, C Sakaridis… - Proceedings of the …, 2024 - openaccess.thecvf.com

The estimation of implicit cross-frame correspondences and the high computational cost
have long been major challenges in video semantic segmentation (VSS) for driving scenes …

被引用次数：1 相关文章所有 5 个版本

[PDF] thecvf.com

Motion-state Alignment for Video Semantic Segmentation

J Su, R Yin, S Zhang, J Luo - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com

In recent years, video semantic segmentation has made great progress with advanced deep
neural networks. However, there still exist two main challenges ie, information inconsistency …

被引用次数：8 相关文章所有 5 个版本