A review of convolutional neural network architectures and their optimizations
The research advances concerning the typical architectures of convolutional neural
networks (CNNs) as well as their optimizations are analyzed and elaborated in detail in this …
networks (CNNs) as well as their optimizations are analyzed and elaborated in detail in this …
Deep learning for video object segmentation: a review
As one of the fundamental problems in the field of video understanding, video object
segmentation aims at segmenting objects of interest throughout the given video sequence …
segmentation aims at segmenting objects of interest throughout the given video sequence …
Moviechat: From dense token to sparse memory for long video understanding
Recently integrating video foundation models and large language models to build a video
understanding system can overcome the limitations of specific pre-defined vision tasks. Yet …
understanding system can overcome the limitations of specific pre-defined vision tasks. Yet …
Xmem: Long-term video object segmentation with an atkinson-shiffrin memory model
HK Cheng, AG Schwing - European Conference on Computer Vision, 2022 - Springer
We present XMem, a video object segmentation architecture for long videos with unified
feature memory stores inspired by the Atkinson-Shiffrin memory model. Prior work on video …
feature memory stores inspired by the Atkinson-Shiffrin memory model. Prior work on video …
Associating objects with transformers for video object segmentation
This paper investigates how to realize better and more efficient embedding learning to tackle
the semi-supervised video object segmentation under challenging multi-object scenarios …
the semi-supervised video object segmentation under challenging multi-object scenarios …
Putting the object back into video object segmentation
We present Cutie a video object segmentation (VOS) network with object-level memory
reading which puts the object representation from memory back into the video object …
reading which puts the object representation from memory back into the video object …
Rethinking space-time networks with improved memory coverage for efficient video object segmentation
This paper presents a simple yet effective approach to modeling space-time
correspondences in the context of video object segmentation. Unlike most existing …
correspondences in the context of video object segmentation. Unlike most existing …
Decoupling features in hierarchical propagation for video object segmentation
This paper focuses on developing a more effective method of hierarchical propagation for
semi-supervised Video Object Segmentation (VOS). Based on vision transformers, the …
semi-supervised Video Object Segmentation (VOS). Based on vision transformers, the …
Full-duplex strategy for video object segmentation
Appearance and motion are two important sources of information in video object
segmentation (VOS). Previous methods mainly focus on using simplex solutions, lowering …
segmentation (VOS). Previous methods mainly focus on using simplex solutions, lowering …
Modular interactive video object segmentation: Interaction-to-mask, propagation and difference-aware fusion
We present Modular interactive VOS (MiVOS) framework which decouples interaction-to-
mask and mask propagation, allowing for higher generalizability and better performance …
mask and mask propagation, allowing for higher generalizability and better performance …