Lasot: A high-quality benchmark for large-scale single object tracking

N Le, VS Rathour, K Yamazaki, K Luu… - Artificial Intelligence …, 2022 - Springer

Deep reinforcement learning augments the reinforcement learning framework and utilizes
the powerful representation of deep neural networks. Recent works have demonstrated the …

被引用次数：157 相关文章所有 10 个版本

Applications of convolutional neural networks for intelligent waste identification and recycling: A review

TW Wu, H Zhang, W Peng, F Lü, PJ He - Resources, Conservation and …, 2023 - Elsevier

With the implementations of “Zero Waste” and Industry 4.0, the rapidly increasing
applications of artificial intelligence in waste management have generated a large amount of …

被引用次数：51 相关文章所有 2 个版本

[PDF] thecvf.com

Mixformer: End-to-end tracking with iterative mixed attention

Y Cui, C Jiang, L Wang, G Wu - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com

Tracking often uses a multi-stage pipeline of feature extraction, target information
integration, and bounding box estimation. To simplify this pipeline and unify the process of …

被引用次数：409 相关文章所有 11 个版本

[PDF] thecvf.com

Seqtrack: Sequence to sequence learning for visual object tracking

X Chen, H Peng, D Wang, H Lu… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

In this paper, we present a new sequence-to-sequence learning framework for visual
tracking, dubbed SeqTrack. It casts visual tracking as a sequence generation problem …

被引用次数：98 相关文章所有 5 个版本

[PDF] arxiv.org

Joint feature learning and relation modeling for tracking: A one-stream framework

B Ye, H Chang, B Ma, S Shan, X Chen - European Conference on …, 2022 - Springer

The current popular two-stream, two-stage tracking framework extracts the template and the
search region features separately and then performs relation modeling, thus the extracted …

被引用次数：305 相关文章所有 5 个版本

[PDF] thecvf.com

Universal instance perception as object discovery and retrieval

B Yan, Y Jiang, J Wu, D Wang, P Luo… - Proceedings of the …, 2023 - openaccess.thecvf.com

All instance perception tasks aim at finding certain objects specified by some queries such
as category names, language expressions, and target annotations, but this complete field …

被引用次数：99 相关文章所有 5 个版本

[PDF] thecvf.com

Visual prompt multi-modal tracking

J Zhu, S Lai, X Chen, D Wang… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Visible-modal object tracking gives rise to a series of downstream multi-modal tracking
tributaries. To inherit the powerful representations of the foundation model, a natural modus …

被引用次数：106 相关文章所有 6 个版本

[PDF] arxiv.org

Aiatrack: Attention in attention for transformer visual tracking

S Gao, C Zhou, C Ma, X Wang, J Yuan - European Conference on …, 2022 - Springer

Transformer trackers have achieved impressive advancements recently, where the attention
mechanism plays an important role. However, the independent correlation computation in …

被引用次数：183 相关文章所有 5 个版本

[PDF] thecvf.com

Transforming model prediction for tracking

C Mayer, M Danelljan, G Bhat, M Paul… - Proceedings of the …, 2022 - openaccess.thecvf.com

Optimization based tracking methods have been widely successful by integrating a target
model prediction module, providing effective global reasoning by minimizing an objective …

被引用次数：241 相关文章所有 8 个版本

[PDF] thecvf.com

Ego4d: Around the world in 3,000 hours of egocentric video

K Grauman, A Westbury, E Byrne… - Proceedings of the …, 2022 - openaccess.thecvf.com

We introduce Ego4D, a massive-scale egocentric video dataset and benchmark suite. It
offers 3,670 hours of daily-life activity video spanning hundreds of scenarios (household …

被引用次数：652 相关文章所有 13 个版本