Multi-scale based context-aware net for action detection

Z Wang, J Guo, L Zeng, C Zhang… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org

Sonar image object detection is essential in underwater rescue and resource exploration.
Although many convolution neural network (CNN)-based object detection algorithms have …

被引用次数：33 相关文章所有 2 个版本

[PDF] thecvf.com

Improved soccer action spotting using both audio and video streams

B Vanderplaetse, S Dupont - Proceedings of the IEEE/CVF …, 2020 - openaccess.thecvf.com

In this paper, we propose a study on multi-modal (audio and video) action spotting and
classification in soccer videos. Action spotting and classification are the tasks that consist in …

被引用次数：52 相关文章所有 10 个版本

[PDF] arxiv.org

Adaptive mutual supervision for weakly-supervised temporal action localization

C Ju, P Zhao, S Chen, Y Zhang, X Zhang… - IEEE Transactions …, 2022 - ieeexplore.ieee.org

Weakly-supervised temporal action localization aims to localize actions from untrimmed long
videos with only video-level category labels. Most previous methods ignore the …

被引用次数：21 相关文章所有 5 个版本

Video representation learning for temporal action detection using global-local attention

Y Tang, Y Zheng, C Wei, K Guo, H Hu, J Liang - Pattern Recognition, 2023 - Elsevier

Video representation is of significant importance for temporal action detection. The two sub-
tasks of temporal action detection, ie, action classification and action localization, have …

被引用次数：10 相关文章所有 3 个版本

[PDF] arxiv.org

Segtad: Precise temporal action detection via semantic segmentation

C Zhao, M Ramazanova, M Xu, B Ghanem - European Conference on …, 2022 - Springer

Temporal action detection (TAD) is an important yet challenging task in video analysis. Most
existing works draw inspiration from image object detection and tend to reformulate it as a …

被引用次数：8 相关文章所有 7 个版本

Multi-dimensional attention with similarity constraint for weakly-supervised temporal action localization

Z Chen, H Liu, L Zhang, X Liao - IEEE Transactions on …, 2022 - ieeexplore.ieee.org

Weakly-supervised temporal action localization (WTAL) is a challenging task in
understanding untrimmed videos, in which no frame-wise annotation is provided during …

被引用次数：8 相关文章所有 2 个版本

Diffusion-based framework for weakly-supervised temporal action localization

Y Zou, Q Zhao, PK Sarker, S Li, L Wang, W Liu - Pattern Recognition, 2025 - Elsevier

Weakly supervised temporal action localization aims to localize action instances with only
video-level supervision. Due to the absence of frame-level annotation supervision, how …

Semi-supervised temporal action proposal generation via exploiting 2-D proposal map

W Wang, T Lin, D He, F Li, S Wen… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org

Temporal action proposal generation aims to generate temporal video segments containing
human actions in untrimmed videos, which is always a preliminary for such video …

被引用次数：7 相关文章所有 2 个版本

[PDF] mlr.press

Temporal RPN Learning for Weakly-Supervised Temporal Action Localization

J Huang, M Kong, L Chen, T Liang… - Asian Conference on …, 2024 - proceedings.mlr.press

Abstract Weakly-Supervised Temporal Action Localization (WSTAL) aims to train an action
instance localization model from untrimmed videos with only video-level labels, similar to the …

[PDF] arxiv.org

Towards Student Actions in Classroom Scenes: New Dataset and Baseline

Z Tan, C Gao, A Qin, R Chen, T Song, F Yang… - arXiv preprint arXiv …, 2024 - arxiv.org

Analyzing student actions is an important and challenging task in educational research.
Existing efforts have been hampered by the lack of accessible datasets to capture the …