Breaking Modality Gap in RGBT Tracking: Coupled Knowledge Distillation

A Lu, J Zhao, C Li, Y Xiao, B Luo - ACM Multimedia 2024, 2024 - openreview.net
Modality gap between RGB and thermal infrared (TIR) images is a crucial issue but often
overlooked in existing RGBT tracking methods. It can be observed that modality gap mainly …

MambaVT: Spatio-Temporal Contextual Modeling for robust RGB-T Tracking

S Lai, C Liu, J Zhu, B Kang, Y Liu, D Wang… - arXiv preprint arXiv …, 2024 - arxiv.org
Existing RGB-T tracking algorithms have made remarkable progress by leveraging the
global interaction capability and extensive pre-trained models of the Transformer …

Top-down Cross-modal Guidance for Robust RGB-T Tracking

L Chen, B Zhong, Q Liang, Y Zheng… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Most RGB-T trackers heavily rely on bottom-up attention and thus overlook top-down cross-
modal guidance for learning target features. Consequently, the discriminative power of the …

Exploring Multi-modal Spatial-Temporal Contexts for High-performance RGB-T Tracking

T Zhang, Q Jiao, Q Zhang, J Han - IEEE Transactions on Image …, 2024 - ieeexplore.ieee.org
In RGB-T tracking, there exist rich spatial relationships between the target and backgrounds
within multi-modal data as well as sound consistencies of spatial relationships among …

Cross-modulated Attention Transformer for RGBT Tracking

Y Xiao, J Zhao, A Lu, C Li, Y Lin, B Yin, C Liu - arXiv preprint arXiv …, 2024 - arxiv.org
Existing Transformer-based RGBT trackers achieve remarkable performance benefits by
leveraging self-attention to extract uni-modal features and cross-attention to enhance multi …

Middle fusion and multi-stage, multi-form prompts for robust RGB-T tracking

Q Wang, Y Bai, H Song - Neurocomputing, 2024 - Elsevier
RGB-T tracking, a vital downstream task of object tracking, has made remarkable progress in
recent years. Yet, it remains hindered by two major challenges:(1) the trade-off between …

Towards a Generalist and Blind RGB-X Tracker

Y Tan, Z Wu, Y Fu, Z Zhou, G Sun, C Ma… - arXiv preprint arXiv …, 2024 - arxiv.org
With the emergence of a single large model capable of successfully solving a multitude of
tasks in NLP, there has been growing research interest in achieving similar goals in …

AFter: Attention-based Fusion Router for RGBT Tracking

A Lu, W Wang, C Li, J Tang, B Luo - arXiv preprint arXiv:2405.02717, 2024 - arxiv.org
Multi-modal feature fusion as a core investigative component of RGBT tracking emerges
numerous fusion studies in recent years. However, existing RGBT tracking methods widely …

RGBT Tracking via All-layer Multimodal Interactions with Progressive Fusion Mamba

A Lu, W Wang, C Li, J Tang, B Luo - arXiv preprint arXiv:2408.08827, 2024 - arxiv.org
Existing RGBT tracking methods often design various interaction models to perform cross-
modal fusion of each layer, but can not execute the feature interactions among all layers …

Visible-Thermal Multiple Object Tracking: Large-scale Video Dataset and Progressive Fusion Approach

Y Zhu, Q Wang, C Li, J Tang, Z Huang - arXiv preprint arXiv:2408.00969, 2024 - arxiv.org
The complementary benefits from visible and thermal infrared data are widely utilized in
various computer vision task, such as visual tracking, semantic segmentation and object …