Coarse-to-fine feature mining for video semantic segmentation
The contextual information plays a core role in semantic segmentation. As for video
semantic segmentation, the contexts include static contexts and motional contexts …
semantic segmentation, the contexts include static contexts and motional contexts …
Isomer: Isomerous transformer for zero-shot video object segmentation
Recent leading zero-shot video object segmentation (ZVOS) works devote to integrating
appearance and motion information by elaborately designing feature fusion modules and …
appearance and motion information by elaborately designing feature fusion modules and …
Multispectral video semantic segmentation: A benchmark dataset and baseline
Robust and reliable semantic segmentation in complex scenes is crucial for many real-life
applications such as autonomous safe driving and nighttime rescue. In most approaches, it …
applications such as autonomous safe driving and nighttime rescue. In most approaches, it …
Neural video depth stabilizer
Video depth estimation aims to infer temporally consistent depth. Some methods achieve
temporal consistency by finetuning a single-image depth model during test time using …
temporal consistency by finetuning a single-image depth model during test time using …
Mining relations among cross-frame affinities for video semantic segmentation
The essence of video semantic segmentation (VSS) is how to leverage temporal information
for prediction. Previous efforts are mainly devoted to developing new techniques to calculate …
for prediction. Previous efforts are mainly devoted to developing new techniques to calculate …
Mask propagation for efficient video semantic segmentation
Abstract Video Semantic Segmentation (VSS) involves assigning a semantic label to each
pixel in a video sequence. Prior work in this field has demonstrated promising results by …
pixel in a video sequence. Prior work in this field has demonstrated promising results by …
Latency matters: Real-time action forecasting transformer
We present RAFTformer, a real-time action forecasting transformer for latency aware real-
world action forecasting applications. RAFTformer is a two-stage fully transformer based …
world action forecasting applications. RAFTformer is a two-stage fully transformer based …
Combining implicit-explicit view correlation for light field semantic segmentation
Since light field simultaneously records spatial information and angular information of light
rays, it is considered to be beneficial for many potential applications, and semantic …
rays, it is considered to be beneficial for many potential applications, and semantic …
Vanishing-point-guided video semantic segmentation of driving scenes
The estimation of implicit cross-frame correspondences and the high computational cost
have long been major challenges in video semantic segmentation (VSS) for driving scenes …
have long been major challenges in video semantic segmentation (VSS) for driving scenes …
Motion-state Alignment for Video Semantic Segmentation
In recent years, video semantic segmentation has made great progress with advanced deep
neural networks. However, there still exist two main challenges ie, information inconsistency …
neural networks. However, there still exist two main challenges ie, information inconsistency …