Sdstrack: Self-distillation symmetric adapter learning for multi-modal visual object tracking
Abstract Multimodal Visual Object Tracking (VOT) has recently gained significant attention
due to its robustness. Early research focused on fully fine-tuning RGB-based trackers which …
due to its robustness. Early research focused on fully fine-tuning RGB-based trackers which …
High performance RGB-Thermal Video Object Detection via hybrid fusion with progressive interaction and temporal-modal difference
Q Wang, Z Tu, C Li, J Tang - Information Fusion, 2025 - Elsevier
Abstract RGB-Thermal Video Object Detection (RGBT VOD) is to localize and classify the
predefined objects in visible and thermal spectrum videos. The key issue in RGBT VOD lies …
predefined objects in visible and thermal spectrum videos. The key issue in RGBT VOD lies …
Unveiling the Limits of Alignment: Multi-modal Dynamic Local Fusion Network and A Benchmark for Unaligned RGBT Video Object Detection
Q Wang, Z Tu, K Wang, L Gu, C Guo - arXiv preprint arXiv:2410.12143, 2024 - arxiv.org
Current RGB-Thermal Video Object Detection (RGBT VOD) methods still depend on
manually aligning data at the image level, which hampers its practical application in real …
manually aligning data at the image level, which hampers its practical application in real …