Deep learning-based action detection in untrimmed videos: A survey
Understanding human behavior and activity facilitates advancement of numerous real-world
applications, and is critical for video analysis. Despite the progress of action recognition …
applications, and is critical for video analysis. Despite the progress of action recognition …
Asm-loc: Action-aware segment modeling for weakly-supervised temporal action localization
Weakly-supervised temporal action localization aims to recognize and localize action
segments in untrimmed videos given only video-level action labels for training. Without the …
segments in untrimmed videos given only video-level action labels for training. Without the …
Fine-grained temporal contrastive learning for weakly-supervised temporal action localization
We target at the task of weakly-supervised action localization (WSAL), where only video-
level action labels are available during model training. Despite the recent progress, existing …
level action labels are available during model training. Despite the recent progress, existing …
Cola: Weakly-supervised temporal action localization with snippet contrastive learning
Weakly-supervised temporal action localization (WS-TAL) aims to localize actions in
untrimmed videos with only video-level labels. Most existing models follow the" localization …
untrimmed videos with only video-level labels. Most existing models follow the" localization …
Weakly supervised temporal action localization via representative snippet knowledge propagation
Weakly supervised temporal action localization targets at localizing temporal boundaries of
actions and simultaneously identify their categories with only video-level category labels …
actions and simultaneously identify their categories with only video-level category labels …
Exploring denoised cross-video contrast for weakly-supervised temporal action localization
Weakly-supervised temporal action localization aims to localize actions in untrimmed videos
with only video-level labels. Most existing methods address this problem with a" localization …
with only video-level labels. Most existing methods address this problem with a" localization …
Foreground-action consistency network for weakly supervised temporal action localization
As a challenging task of high-level video understanding, weakly supervised temporal action
localization has been attracting increasing attention. With only video annotations, most …
localization has been attracting increasing attention. With only video annotations, most …
Learning action completeness from points for weakly-supervised temporal action localization
We tackle the problem of localizing temporal intervals of actions with only a single frame
label for each action instance for training. Owing to label sparsity, existing work fails to learn …
label for each action instance for training. Owing to label sparsity, existing work fails to learn …
Uncertainty guided collaborative training for weakly supervised temporal action detection
Weakly supervised temporal action detection aims to localize temporal boundaries of
actions and identify their categories simultaneously with only video-level category labels …
actions and identify their categories simultaneously with only video-level category labels …
Action unit memory network for weakly supervised temporal action localization
Weakly supervised temporal action localization aims to detect and localize actions in
untrimmed videos with only video-level labels during training. However, without frame-level …
untrimmed videos with only video-level labels during training. However, without frame-level …