Learning activity progression in lstms for activity detection and early detection

G Van Houdt, C Mosquera, G Nápoles - Artificial Intelligence Review, 2020 - Springer

Long short-term memory (LSTM) has transformed both machine learning and
neurocomputing fields. According to several online sources, this model has improved …

被引用次数：1205 相关文章所有 9 个版本

[HTML] sciencedirect.com

[HTML][HTML] Deep learning algorithms for human activity recognition using mobile and wearable sensor networks: State of the art and research challenges

HF Nweke, YW Teh, MA Al-Garadi, UR Alo - Expert Systems with …, 2018 - Elsevier

Human activity recognition systems are developed as part of a framework to enable
continuous monitoring of human behaviours in the area of ambient assisted living, sports …

被引用次数：924 相关文章所有 2 个版本

[PDF] thecvf.com

Anticipative video transformer

R Girdhar, K Grauman - Proceedings of the IEEE/CVF …, 2021 - openaccess.thecvf.com

Abstract We propose Anticipative Video Transformer (AVT), an end-to-end attention-based
video modeling architecture that attends to the previously observed video in order to …

被引用次数：220 相关文章所有 6 个版本

[PDF] ieee.org

End-to-end temporal action detection with transformer

X Liu, Q Wang, Y Hu, X Tang, S Zhang… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org

Temporal action detection (TAD) aims to determine the semantic label and the temporal
interval of every action instance in an untrimmed video. It is a fundamental and challenging …

被引用次数：213 相关文章所有 5 个版本

[HTML] springer.com

[HTML][HTML] Human action recognition and prediction: A survey

Y Kong, Y Fu - International Journal of Computer Vision, 2022 - Springer

Derived from rapid advances in computer vision and machine learning, video analysis tasks
have been moving from inferring the present state to predicting the future state. Vision-based …

被引用次数：698 相关文章所有 6 个版本

[PDF] thecvf.com

Rethinking the faster r-cnn architecture for temporal action localization

YW Chao, S Vijayanarasimhan… - Proceedings of the …, 2018 - openaccess.thecvf.com

We propose TAL-Net, an improved approach to temporal action localization in video that is
inspired by the Faster R-CNN object detection framework. TAL-Net addresses three key …

被引用次数：781 相关文章所有 9 个版本

[PDF] thecvf.com

Peeking into the future: Predicting future person activities and locations in videos

J Liang, L Jiang, JC Niebles… - Proceedings of the …, 2019 - openaccess.thecvf.com

Deciphering human behaviors to predict their future paths/trajectories and what they would
do from videos is important in many applications. Motivated by this idea, this paper studies …

被引用次数：461 相关文章所有 16 个版本

[PDF] openreview.net

Eidetic 3D LSTM: A model for video prediction and beyond

Y Wang, L Jiang, MH Yang, LJ Li, M Long… - International …, 2018 - openreview.net

Spatiotemporal predictive learning, though long considered to be a promising self-
supervised feature learning method, seldom shows its effectiveness beyond future video …

被引用次数：423 相关文章所有 10 个版本

[PDF] thecvf.com

Tall: Temporal activity localization via language query

J Gao, C Sun, Z Yang… - Proceedings of the IEEE …, 2017 - openaccess.thecvf.com

This paper focuses on temporal localization of actions from untrimmed videos. Existing
methods typically involve training classifiers for a pre-defined list of actions and applying the …

被引用次数：792 相关文章所有 8 个版本

[PDF] thecvf.com

R-c3d: Region convolutional 3d network for temporal activity detection

H Xu, A Das, K Saenko - Proceedings of the IEEE …, 2017 - openaccess.thecvf.com

We address the problem of activity detection in continuous, untrimmed video streams. This is
a difficult task that requires extracting meaningful spatio-temporal features to capture …

被引用次数：886 相关文章所有 7 个版本