MLFFNet: Multilevel feature fusion network for object detection in sonar images

Z Wang, J Guo, L Zeng, C Zhang… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Sonar image object detection is essential in underwater rescue and resource exploration.
Although many convolution neural network (CNN)-based object detection algorithms have …

Improved soccer action spotting using both audio and video streams

B Vanderplaetse, S Dupont - Proceedings of the IEEE/CVF …, 2020 - openaccess.thecvf.com
In this paper, we propose a study on multi-modal (audio and video) action spotting and
classification in soccer videos. Action spotting and classification are the tasks that consist in …

Adaptive mutual supervision for weakly-supervised temporal action localization

C Ju, P Zhao, S Chen, Y Zhang, X Zhang… - IEEE Transactions …, 2022 - ieeexplore.ieee.org
Weakly-supervised temporal action localization aims to localize actions from untrimmed long
videos with only video-level category labels. Most previous methods ignore the …

Video representation learning for temporal action detection using global-local attention

Y Tang, Y Zheng, C Wei, K Guo, H Hu, J Liang - Pattern Recognition, 2023 - Elsevier
Video representation is of significant importance for temporal action detection. The two sub-
tasks of temporal action detection, ie, action classification and action localization, have …

Segtad: Precise temporal action detection via semantic segmentation

C Zhao, M Ramazanova, M Xu, B Ghanem - European Conference on …, 2022 - Springer
Temporal action detection (TAD) is an important yet challenging task in video analysis. Most
existing works draw inspiration from image object detection and tend to reformulate it as a …

Multi-dimensional attention with similarity constraint for weakly-supervised temporal action localization

Z Chen, H Liu, L Zhang, X Liao - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Weakly-supervised temporal action localization (WTAL) is a challenging task in
understanding untrimmed videos, in which no frame-wise annotation is provided during …

Diffusion-based framework for weakly-supervised temporal action localization

Y Zou, Q Zhao, PK Sarker, S Li, L Wang, W Liu - Pattern Recognition, 2025 - Elsevier
Weakly supervised temporal action localization aims to localize action instances with only
video-level supervision. Due to the absence of frame-level annotation supervision, how …

Semi-supervised temporal action proposal generation via exploiting 2-D proposal map

W Wang, T Lin, D He, F Li, S Wen… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Temporal action proposal generation aims to generate temporal video segments containing
human actions in untrimmed videos, which is always a preliminary for such video …

Temporal RPN Learning for Weakly-Supervised Temporal Action Localization

J Huang, M Kong, L Chen, T Liang… - Asian Conference on …, 2024 - proceedings.mlr.press
Abstract Weakly-Supervised Temporal Action Localization (WSTAL) aims to train an action
instance localization model from untrimmed videos with only video-level labels, similar to the …

Towards Student Actions in Classroom Scenes: New Dataset and Baseline

Z Tan, C Gao, A Qin, R Chen, T Song, F Yang… - arXiv preprint arXiv …, 2024 - arxiv.org
Analyzing student actions is an important and challenging task in educational research.
Existing efforts have been hampered by the lack of accessible datasets to capture the …