MLFFNet: Multilevel feature fusion network for object detection in sonar images
Z Wang, J Guo, L Zeng, C Zhang… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Sonar image object detection is essential in underwater rescue and resource exploration.
Although many convolution neural network (CNN)-based object detection algorithms have …
Although many convolution neural network (CNN)-based object detection algorithms have …
Improved soccer action spotting using both audio and video streams
B Vanderplaetse, S Dupont - Proceedings of the IEEE/CVF …, 2020 - openaccess.thecvf.com
In this paper, we propose a study on multi-modal (audio and video) action spotting and
classification in soccer videos. Action spotting and classification are the tasks that consist in …
classification in soccer videos. Action spotting and classification are the tasks that consist in …
Adaptive mutual supervision for weakly-supervised temporal action localization
Weakly-supervised temporal action localization aims to localize actions from untrimmed long
videos with only video-level category labels. Most previous methods ignore the …
videos with only video-level category labels. Most previous methods ignore the …
Video representation learning for temporal action detection using global-local attention
Video representation is of significant importance for temporal action detection. The two sub-
tasks of temporal action detection, ie, action classification and action localization, have …
tasks of temporal action detection, ie, action classification and action localization, have …
Segtad: Precise temporal action detection via semantic segmentation
Temporal action detection (TAD) is an important yet challenging task in video analysis. Most
existing works draw inspiration from image object detection and tend to reformulate it as a …
existing works draw inspiration from image object detection and tend to reformulate it as a …
Multi-dimensional attention with similarity constraint for weakly-supervised temporal action localization
Weakly-supervised temporal action localization (WTAL) is a challenging task in
understanding untrimmed videos, in which no frame-wise annotation is provided during …
understanding untrimmed videos, in which no frame-wise annotation is provided during …
Diffusion-based framework for weakly-supervised temporal action localization
Y Zou, Q Zhao, PK Sarker, S Li, L Wang, W Liu - Pattern Recognition, 2025 - Elsevier
Weakly supervised temporal action localization aims to localize action instances with only
video-level supervision. Due to the absence of frame-level annotation supervision, how …
video-level supervision. Due to the absence of frame-level annotation supervision, how …
Semi-supervised temporal action proposal generation via exploiting 2-D proposal map
Temporal action proposal generation aims to generate temporal video segments containing
human actions in untrimmed videos, which is always a preliminary for such video …
human actions in untrimmed videos, which is always a preliminary for such video …
Temporal RPN Learning for Weakly-Supervised Temporal Action Localization
Abstract Weakly-Supervised Temporal Action Localization (WSTAL) aims to train an action
instance localization model from untrimmed videos with only video-level labels, similar to the …
instance localization model from untrimmed videos with only video-level labels, similar to the …
Towards Student Actions in Classroom Scenes: New Dataset and Baseline
Analyzing student actions is an important and challenging task in educational research.
Existing efforts have been hampered by the lack of accessible datasets to capture the …
Existing efforts have been hampered by the lack of accessible datasets to capture the …