A review of convolutional neural network architectures and their optimizations

S Cong, Y Zhou - Artificial Intelligence Review, 2023 - Springer
The research advances concerning the typical architectures of convolutional neural
networks (CNNs) as well as their optimizations are analyzed and elaborated in detail in this …

Moviechat: From dense token to sparse memory for long video understanding

E Song, W Chai, G Wang, Y Zhang… - Proceedings of the …, 2024 - openaccess.thecvf.com
Recently integrating video foundation models and large language models to build a video
understanding system can overcome the limitations of specific pre-defined vision tasks. Yet …

Xmem: Long-term video object segmentation with an atkinson-shiffrin memory model

HK Cheng, AG Schwing - European Conference on Computer Vision, 2022 - Springer
We present XMem, a video object segmentation architecture for long videos with unified
feature memory stores inspired by the Atkinson-Shiffrin memory model. Prior work on video …

Aiatrack: Attention in attention for transformer visual tracking

S Gao, C Zhou, C Ma, X Wang, J Yuan - European Conference on …, 2022 - Springer
Transformer trackers have achieved impressive advancements recently, where the attention
mechanism plays an important role. However, the independent correlation computation in …

Putting the object back into video object segmentation

HK Cheng, SW Oh, B Price, JY Lee… - Proceedings of the …, 2024 - openaccess.thecvf.com
We present Cutie a video object segmentation (VOS) network with object-level memory
reading which puts the object representation from memory back into the video object …

Boosting video object segmentation via space-time correspondence learning

Y Zhang, L Li, W Wang, R Xie… - Proceedings of the …, 2023 - openaccess.thecvf.com
Current top-leading solutions for video object segmentation (VOS) typically follow a
matching-based regime: for each query frame, the segmentation mask is inferred according …

Hierarchical feature alignment network for unsupervised video object segmentation

G Pei, F Shen, Y Yao, GS Xie, Z Tang… - European Conference on …, 2022 - Springer
Optical flow is an easily conceived and precious cue for advancing unsupervised video
object segmentation (UVOS). Most of the previous methods directly extract and fuse the …

Recurrent dynamic embedding for video object segmentation

M Li, L Hu, Z Xiong, B Zhang… - Proceedings of the …, 2022 - openaccess.thecvf.com
Abstract Space-time memory (STM) based video object segmentation (VOS) networks
usually keep increasing memory bank every several frames, which shows excellent …

Per-clip video object segmentation

K Park, S Woo, SW Oh, IS Kweon… - Proceedings of the …, 2022 - openaccess.thecvf.com
Recently, memory-based approaches show promising results on semi-supervised video
object segmentation. These methods predict object masks frame-by-frame with the help of …

Scalable video object segmentation with simplified framework

Q Wu, T Yang, W Wu, AB Chan - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
The current popular methods for video object segmentation (VOS) implement feature
matching through several hand-crafted modules that separately perform feature extraction …