A review on machine learning styles in computer vision—techniques and future directions

SV Mahadevkar, B Khemani, S Patil, K Kotecha… - Ieee …, 2022 - ieeexplore.ieee.org
Computer applications have considerably shifted from single data processing to machine
learning in recent years due to the accessibility and availability of massive volumes of data …

[HTML][HTML] Deep learning for video object segmentation: a review

M Gao, F Zheng, JJQ Yu, C Shan, G Ding… - Artificial Intelligence …, 2023 - Springer
As one of the fundamental problems in the field of video understanding, video object
segmentation aims at segmenting objects of interest throughout the given video sequence …

Xmem: Long-term video object segmentation with an atkinson-shiffrin memory model

HK Cheng, AG Schwing - European Conference on Computer Vision, 2022 - Springer
We present XMem, a video object segmentation architecture for long videos with unified
feature memory stores inspired by the Atkinson-Shiffrin memory model. Prior work on video …

Universal instance perception as object discovery and retrieval

B Yan, Y Jiang, J Wu, D Wang, P Luo… - Proceedings of the …, 2023 - openaccess.thecvf.com
All instance perception tasks aim at finding certain objects specified by some queries such
as category names, language expressions, and target annotations, but this complete field …

MOSE: A new dataset for video object segmentation in complex scenes

H Ding, C Liu, S He, X Jiang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Video object segmentation (VOS) aims at segmenting a particular object throughout the
entire video clip sequence. The state-of-the-art VOS methods have achieved excellent …

Towards grand unification of object tracking

B Yan, Y Jiang, P Sun, D Wang, Z Yuan, P Luo… - European Conference on …, 2022 - Springer
We present a unified method, termed Unicorn, that can simultaneously solve four tracking
problems (SOT, MOT, VOS, MOTS) with a single network using the same model parameters …

Dropmae: Masked autoencoders with spatial-attention dropout for tracking tasks

Q Wu, T Yang, Z Liu, B Wu, Y Shan… - Proceedings of the …, 2023 - openaccess.thecvf.com
In this paper, we study masked autoencoder (MAE) pretraining on videos for matching-
based downstream tasks, including visual object tracking (VOT) and video object …

Visible-thermal UAV tracking: A large-scale benchmark and new baseline

P Zhang, J Zhao, D Wang, H Lu… - Proceedings of the …, 2022 - openaccess.thecvf.com
With the popularity of multi-modal sensors, visible-thermal (RGB-T) object tracking is to
achieve robust performance and wider application scenarios with the guidance of objects' …

Rethinking space-time networks with improved memory coverage for efficient video object segmentation

HK Cheng, YW Tai, CK Tang - Advances in Neural …, 2021 - proceedings.neurips.cc
This paper presents a simple yet effective approach to modeling space-time
correspondences in the context of video object segmentation. Unlike most existing …

Learn to match: Automatic matching network design for visual tracking

Z Zhang, Y Liu, X Wang, B Li… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Siamese tracking has achieved groundbreaking performance in recent years, where the
essence is the efficient matching operator cross-correlation and its variants. Besides the …