Deep learning for visual tracking: A comprehensive survey
SM Marvasti-Zadeh, L Cheng… - IEEE Transactions …, 2021 - ieeexplore.ieee.org
Visual target tracking is one of the most sought-after yet challenging research topics in
computer vision. Given the ill-posed nature of the problem and its popularity in a broad …
computer vision. Given the ill-posed nature of the problem and its popularity in a broad …
New generation deep learning for video object detection: A survey
Video object detection, a basic task in the computer vision field, is rapidly evolving and
widely used. In recent years, deep learning methods have rapidly become widespread in the …
widely used. In recent years, deep learning methods have rapidly become widespread in the …
Mixformer: End-to-end tracking with iterative mixed attention
Tracking often uses a multi-stage pipeline of feature extraction, target information
integration, and bounding box estimation. To simplify this pipeline and unify the process of …
integration, and bounding box estimation. To simplify this pipeline and unify the process of …
Transforming model prediction for tracking
Optimization based tracking methods have been widely successful by integrating a target
model prediction module, providing effective global reasoning by minimizing an objective …
model prediction module, providing effective global reasoning by minimizing an objective …
Autoregressive visual tracking
We present ARTrack, an autoregressive framework for visual object tracking. ARTrack
tackles tracking as a coordinate sequence interpretation task that estimates object …
tackles tracking as a coordinate sequence interpretation task that estimates object …
Learning spatio-temporal transformer for visual tracking
In this paper, we present a new tracking architecture with an encoder-decoder transformer
as the key component. The encoder models the global spatio-temporal feature …
as the key component. The encoder models the global spatio-temporal feature …
Transformer meets tracker: Exploiting temporal context for robust visual tracking
In video object tracking, there exist rich temporal contexts among successive frames, which
have been largely overlooked in existing trackers. In this work, we bridge the individual …
have been largely overlooked in existing trackers. In this work, we bridge the individual …
Transformer tracking with cyclic shifting window attention
Transformer architecture has been showing its great strength in visual object tracking, for its
effective attention mechanism. Existing transformer-based approaches adopt the pixel-to …
effective attention mechanism. Existing transformer-based approaches adopt the pixel-to …
Backbone is all your need: A simplified architecture for visual object tracking
Exploiting a general-purpose neural architecture to replace hand-wired designs or inductive
biases has recently drawn extensive interest. However, existing tracking approaches rely on …
biases has recently drawn extensive interest. However, existing tracking approaches rely on …
TCTrack: Temporal contexts for aerial tracking
Temporal contexts among consecutive frames are far from being fully utilized in existing
visual trackers. In this work, we present TCTrack, a comprehensive framework to fully exploit …
visual trackers. In this work, we present TCTrack, a comprehensive framework to fully exploit …