A review of convolutional neural network architectures and their optimizations

S Cong, Y Zhou - Artificial Intelligence Review, 2023 - Springer
The research advances concerning the typical architectures of convolutional neural
networks (CNNs) as well as their optimizations are analyzed and elaborated in detail in this …

Deep learning for video object segmentation: a review

M Gao, F Zheng, JJQ Yu, C Shan, G Ding… - Artificial Intelligence …, 2023 - Springer
As one of the fundamental problems in the field of video understanding, video object
segmentation aims at segmenting objects of interest throughout the given video sequence …

Moviechat: From dense token to sparse memory for long video understanding

E Song, W Chai, G Wang, Y Zhang… - Proceedings of the …, 2024 - openaccess.thecvf.com
Recently integrating video foundation models and large language models to build a video
understanding system can overcome the limitations of specific pre-defined vision tasks. Yet …

Xmem: Long-term video object segmentation with an atkinson-shiffrin memory model

HK Cheng, AG Schwing - European Conference on Computer Vision, 2022 - Springer
We present XMem, a video object segmentation architecture for long videos with unified
feature memory stores inspired by the Atkinson-Shiffrin memory model. Prior work on video …

Associating objects with transformers for video object segmentation

Z Yang, Y Wei, Y Yang - Advances in Neural Information …, 2021 - proceedings.neurips.cc
This paper investigates how to realize better and more efficient embedding learning to tackle
the semi-supervised video object segmentation under challenging multi-object scenarios …

Putting the object back into video object segmentation

HK Cheng, SW Oh, B Price, JY Lee… - Proceedings of the …, 2024 - openaccess.thecvf.com
We present Cutie a video object segmentation (VOS) network with object-level memory
reading which puts the object representation from memory back into the video object …

Rethinking space-time networks with improved memory coverage for efficient video object segmentation

HK Cheng, YW Tai, CK Tang - Advances in Neural …, 2021 - proceedings.neurips.cc
This paper presents a simple yet effective approach to modeling space-time
correspondences in the context of video object segmentation. Unlike most existing …

Decoupling features in hierarchical propagation for video object segmentation

Z Yang, Y Yang - Advances in Neural Information …, 2022 - proceedings.neurips.cc
This paper focuses on developing a more effective method of hierarchical propagation for
semi-supervised Video Object Segmentation (VOS). Based on vision transformers, the …

Full-duplex strategy for video object segmentation

GP Ji, K Fu, Z Wu, DP Fan, J Shen… - Proceedings of the …, 2021 - openaccess.thecvf.com
Appearance and motion are two important sources of information in video object
segmentation (VOS). Previous methods mainly focus on using simplex solutions, lowering …

Modular interactive video object segmentation: Interaction-to-mask, propagation and difference-aware fusion

HK Cheng, YW Tai, CK Tang - Proceedings of the IEEE/CVF …, 2021 - openaccess.thecvf.com
We present Modular interactive VOS (MiVOS) framework which decouples interaction-to-
mask and mask propagation, allowing for higher generalizability and better performance …