Gaussian temporal awareness networks for action localization

E Vahdani, Y Tian - IEEE Transactions on Pattern Analysis and …, 2022 - ieeexplore.ieee.org

Understanding human behavior and activity facilitates advancement of numerous real-world
applications, and is critical for video analysis. Despite the progress of action recognition …

被引用次数：48 相关文章所有 9 个版本

[PDF] arxiv.org

Actionformer: Localizing moments of actions with transformers

CL Zhang, J Wu, Y Li - European Conference on Computer Vision, 2022 - Springer

Self-attention based Transformer models have demonstrated impressive results for image
classification and object detection, and more recently for video understanding. Inspired by …

被引用次数：263 相关文章所有 7 个版本

[PDF] thecvf.com

Tridet: Temporal action detection with relative boundary modeling

D Shi, Y Zhong, Q Cao, L Ma, J Li… - Proceedings of the …, 2023 - openaccess.thecvf.com

In this paper, we present a one-stage framework TriDet for temporal action detection.
Existing methods often suffer from imprecise boundary predictions due to the ambiguous …

被引用次数：65 相关文章所有 5 个版本

[PDF] thecvf.com

Learning salient boundary feature for anchor-free temporal action localization

C Lin, C Xu, D Luo, Y Wang, Y Tai… - Proceedings of the …, 2021 - openaccess.thecvf.com

Temporal action localization is an important yet challenging task in video understanding.
Typically, such a task aims at inferring both the action category and localization of the start …

被引用次数：247 相关文章所有 7 个版本

TN-ZSTAD: Transferable network for zero-shot temporal activity detection

L Zhang, X Chang, J Liu, M Luo, Z Li… - … on Pattern Analysis …, 2022 - ieeexplore.ieee.org

An integral part of video analysis and surveillance is temporal activity detection, which
means to simultaneously recognize and localize activities in long untrimmed videos …

被引用次数：105 相关文章所有 7 个版本

[PDF] thecvf.com

G-tad: Sub-graph localization for temporal action detection

M Xu, C Zhao, DS Rojas, A Thabet… - Proceedings of the …, 2020 - openaccess.thecvf.com

Temporal action detection is a fundamental yet challenging task in video understanding.
Video context is a critical cue to effectively detect actions, but current works mainly focus on …

被引用次数：493 相关文章所有 15 个版本

[PDF] arxiv.org

End-to-end temporal action detection with transformer

X Liu, Q Wang, Y Hu, X Tang, S Zhang… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org

Temporal action detection (TAD) aims to determine the semantic label and the temporal
interval of every action instance in an untrimmed video. It is a fundamental and challenging …

被引用次数：176 相关文章所有 7 个版本

[PDF] thecvf.com

Asm-loc: Action-aware segment modeling for weakly-supervised temporal action localization

B He, X Yang, L Kang, Z Cheng… - Proceedings of the …, 2022 - openaccess.thecvf.com

Weakly-supervised temporal action localization aims to recognize and localize action
segments in untrimmed videos given only video-level action labels for training. Without the …

被引用次数：83 相关文章所有 7 个版本

[PDF] arxiv.org

Ms-tcn++: Multi-stage temporal convolutional network for action segmentation

S Li, YA Farha, Y Liu, MM Cheng… - IEEE transactions on …, 2020 - ieeexplore.ieee.org

With the success of deep learning in classifying short trimmed videos, more attention has
been focused on temporally segmenting and classifying activities in long untrimmed videos …

被引用次数：243 相关文章所有 9 个版本

[PDF] thecvf.com

Fine-grained temporal contrastive learning for weakly-supervised temporal action localization

J Gao, M Chen, C Xu - … of the IEEE/CVF conference on …, 2022 - openaccess.thecvf.com

We target at the task of weakly-supervised action localization (WSAL), where only video-
level action labels are available during model training. Despite the recent progress, existing …

被引用次数：72 相关文章所有 7 个版本