A review on machine learning styles in computer vision—techniques and future directions
Computer applications have considerably shifted from single data processing to machine
learning in recent years due to the accessibility and availability of massive volumes of data …
learning in recent years due to the accessibility and availability of massive volumes of data …
[HTML][HTML] Deep learning for video object segmentation: a review
As one of the fundamental problems in the field of video understanding, video object
segmentation aims at segmenting objects of interest throughout the given video sequence …
segmentation aims at segmenting objects of interest throughout the given video sequence …
Xmem: Long-term video object segmentation with an atkinson-shiffrin memory model
HK Cheng, AG Schwing - European Conference on Computer Vision, 2022 - Springer
We present XMem, a video object segmentation architecture for long videos with unified
feature memory stores inspired by the Atkinson-Shiffrin memory model. Prior work on video …
feature memory stores inspired by the Atkinson-Shiffrin memory model. Prior work on video …
Universal instance perception as object discovery and retrieval
All instance perception tasks aim at finding certain objects specified by some queries such
as category names, language expressions, and target annotations, but this complete field …
as category names, language expressions, and target annotations, but this complete field …
MOSE: A new dataset for video object segmentation in complex scenes
Video object segmentation (VOS) aims at segmenting a particular object throughout the
entire video clip sequence. The state-of-the-art VOS methods have achieved excellent …
entire video clip sequence. The state-of-the-art VOS methods have achieved excellent …
Towards grand unification of object tracking
We present a unified method, termed Unicorn, that can simultaneously solve four tracking
problems (SOT, MOT, VOS, MOTS) with a single network using the same model parameters …
problems (SOT, MOT, VOS, MOTS) with a single network using the same model parameters …
Dropmae: Masked autoencoders with spatial-attention dropout for tracking tasks
In this paper, we study masked autoencoder (MAE) pretraining on videos for matching-
based downstream tasks, including visual object tracking (VOT) and video object …
based downstream tasks, including visual object tracking (VOT) and video object …
Visible-thermal UAV tracking: A large-scale benchmark and new baseline
With the popularity of multi-modal sensors, visible-thermal (RGB-T) object tracking is to
achieve robust performance and wider application scenarios with the guidance of objects' …
achieve robust performance and wider application scenarios with the guidance of objects' …
Rethinking space-time networks with improved memory coverage for efficient video object segmentation
This paper presents a simple yet effective approach to modeling space-time
correspondences in the context of video object segmentation. Unlike most existing …
correspondences in the context of video object segmentation. Unlike most existing …
Learn to match: Automatic matching network design for visual tracking
Siamese tracking has achieved groundbreaking performance in recent years, where the
essence is the efficient matching operator cross-correlation and its variants. Besides the …
essence is the efficient matching operator cross-correlation and its variants. Besides the …