Deep learning for visual tracking: A comprehensive survey

SM Marvasti-Zadeh, L Cheng… - IEEE Transactions …, 2021 - ieeexplore.ieee.org
Visual target tracking is one of the most sought-after yet challenging research topics in
computer vision. Given the ill-posed nature of the problem and its popularity in a broad …

New generation deep learning for video object detection: A survey

L Jiao, R Zhang, F Liu, S Yang, B Hou… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Video object detection, a basic task in the computer vision field, is rapidly evolving and
widely used. In recent years, deep learning methods have rapidly become widespread in the …

Mixformer: End-to-end tracking with iterative mixed attention

Y Cui, C Jiang, L Wang, G Wu - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
Tracking often uses a multi-stage pipeline of feature extraction, target information
integration, and bounding box estimation. To simplify this pipeline and unify the process of …

Transforming model prediction for tracking

C Mayer, M Danelljan, G Bhat, M Paul… - Proceedings of the …, 2022 - openaccess.thecvf.com
Optimization based tracking methods have been widely successful by integrating a target
model prediction module, providing effective global reasoning by minimizing an objective …

Autoregressive visual tracking

X Wei, Y Bai, Y Zheng, D Shi… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
We present ARTrack, an autoregressive framework for visual object tracking. ARTrack
tackles tracking as a coordinate sequence interpretation task that estimates object …

Learning spatio-temporal transformer for visual tracking

B Yan, H Peng, J Fu, D Wang… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
In this paper, we present a new tracking architecture with an encoder-decoder transformer
as the key component. The encoder models the global spatio-temporal feature …

Transformer meets tracker: Exploiting temporal context for robust visual tracking

N Wang, W Zhou, J Wang, H Li - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
In video object tracking, there exist rich temporal contexts among successive frames, which
have been largely overlooked in existing trackers. In this work, we bridge the individual …

Transformer tracking with cyclic shifting window attention

Z Song, J Yu, YPP Chen… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
Transformer architecture has been showing its great strength in visual object tracking, for its
effective attention mechanism. Existing transformer-based approaches adopt the pixel-to …

Backbone is all your need: A simplified architecture for visual object tracking

B Chen, P Li, L Bai, L Qiao, Q Shen, B Li, W Gan… - … on Computer Vision, 2022 - Springer
Exploiting a general-purpose neural architecture to replace hand-wired designs or inductive
biases has recently drawn extensive interest. However, existing tracking approaches rely on …

TCTrack: Temporal contexts for aerial tracking

Z Cao, Z Huang, L Pan, S Zhang… - Proceedings of the …, 2022 - openaccess.thecvf.com
Temporal contexts among consecutive frames are far from being fully utilized in existing
visual trackers. In this work, we present TCTrack, a comprehensive framework to fully exploit …