Joint inductive and transductive learning for video object segmentation

S Cong, Y Zhou - Artificial Intelligence Review, 2023 - Springer

The research advances concerning the typical architectures of convolutional neural
networks (CNNs) as well as their optimizations are analyzed and elaborated in detail in this …

被引用次数：165 相关文章所有 5 个版本

[PDF] thecvf.com

Moviechat: From dense token to sparse memory for long video understanding

E Song, W Chai, G Wang, Y Zhang… - Proceedings of the …, 2024 - openaccess.thecvf.com

Recently integrating video foundation models and large language models to build a video
understanding system can overcome the limitations of specific pre-defined vision tasks. Yet …

被引用次数：165 相关文章所有 3 个版本

[PDF] arxiv.org

Xmem: Long-term video object segmentation with an atkinson-shiffrin memory model

HK Cheng, AG Schwing - European Conference on Computer Vision, 2022 - Springer

We present XMem, a video object segmentation architecture for long videos with unified
feature memory stores inspired by the Atkinson-Shiffrin memory model. Prior work on video …

被引用次数：385 相关文章所有 8 个版本

[PDF] arxiv.org

Aiatrack: Attention in attention for transformer visual tracking

S Gao, C Zhou, C Ma, X Wang, J Yuan - European Conference on …, 2022 - Springer

Transformer trackers have achieved impressive advancements recently, where the attention
mechanism plays an important role. However, the independent correlation computation in …

被引用次数：271 相关文章所有 5 个版本

[PDF] thecvf.com

Putting the object back into video object segmentation

HK Cheng, SW Oh, B Price, JY Lee… - Proceedings of the …, 2024 - openaccess.thecvf.com

We present Cutie a video object segmentation (VOS) network with object-level memory
reading which puts the object representation from memory back into the video object …

被引用次数：67 相关文章所有 4 个版本

[PDF] thecvf.com

Boosting video object segmentation via space-time correspondence learning

Y Zhang, L Li, W Wang, R Xie… - Proceedings of the …, 2023 - openaccess.thecvf.com

Current top-leading solutions for video object segmentation (VOS) typically follow a
matching-based regime: for each query frame, the segmentation mask is inferred according …

被引用次数：37 相关文章所有 6 个版本

[PDF] arxiv.org

Hierarchical feature alignment network for unsupervised video object segmentation

G Pei, F Shen, Y Yao, GS Xie, Z Tang… - European Conference on …, 2022 - Springer

Optical flow is an easily conceived and precious cue for advancing unsupervised video
object segmentation (UVOS). Most of the previous methods directly extract and fuse the …

被引用次数：73 相关文章所有 5 个版本

[PDF] thecvf.com

Recurrent dynamic embedding for video object segmentation

M Li, L Hu, Z Xiong, B Zhang… - Proceedings of the …, 2022 - openaccess.thecvf.com

Abstract Space-time memory (STM) based video object segmentation (VOS) networks
usually keep increasing memory bank every several frames, which shows excellent …

被引用次数：78 相关文章所有 5 个版本

[PDF] thecvf.com

Per-clip video object segmentation

K Park, S Woo, SW Oh, IS Kweon… - Proceedings of the …, 2022 - openaccess.thecvf.com

Recently, memory-based approaches show promising results on semi-supervised video
object segmentation. These methods predict object masks frame-by-frame with the help of …

被引用次数：62 相关文章所有 8 个版本

[PDF] thecvf.com

Scalable video object segmentation with simplified framework

Q Wu, T Yang, W Wu, AB Chan - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

The current popular methods for video object segmentation (VOS) implement feature
matching through several hand-crafted modules that separately perform feature extraction …

被引用次数：21 相关文章所有 6 个版本