Sdstrack: Self-distillation symmetric adapter learning for multi-modal visual object tracking

X Hou, J Xing, Y Qian, Y Guo, S Xin… - Proceedings of the …, 2024 - openaccess.thecvf.com
Abstract Multimodal Visual Object Tracking (VOT) has recently gained significant attention
due to its robustness. Early research focused on fully fine-tuning RGB-based trackers which …

High performance RGB-Thermal Video Object Detection via hybrid fusion with progressive interaction and temporal-modal difference

Q Wang, Z Tu, C Li, J Tang - Information Fusion, 2025 - Elsevier
Abstract RGB-Thermal Video Object Detection (RGBT VOD) is to localize and classify the
predefined objects in visible and thermal spectrum videos. The key issue in RGBT VOD lies …

Unveiling the Limits of Alignment: Multi-modal Dynamic Local Fusion Network and A Benchmark for Unaligned RGBT Video Object Detection

Q Wang, Z Tu, K Wang, L Gu, C Guo - arXiv preprint arXiv:2410.12143, 2024 - arxiv.org
Current RGB-Thermal Video Object Detection (RGBT VOD) methods still depend on
manually aligning data at the image level, which hampers its practical application in real …