[HTML][HTML] A review on the long short-term memory model
Long short-term memory (LSTM) has transformed both machine learning and
neurocomputing fields. According to several online sources, this model has improved …
neurocomputing fields. According to several online sources, this model has improved …
[HTML][HTML] Deep learning algorithms for human activity recognition using mobile and wearable sensor networks: State of the art and research challenges
Human activity recognition systems are developed as part of a framework to enable
continuous monitoring of human behaviours in the area of ambient assisted living, sports …
continuous monitoring of human behaviours in the area of ambient assisted living, sports …
Anticipative video transformer
Abstract We propose Anticipative Video Transformer (AVT), an end-to-end attention-based
video modeling architecture that attends to the previously observed video in order to …
video modeling architecture that attends to the previously observed video in order to …
End-to-end temporal action detection with transformer
Temporal action detection (TAD) aims to determine the semantic label and the temporal
interval of every action instance in an untrimmed video. It is a fundamental and challenging …
interval of every action instance in an untrimmed video. It is a fundamental and challenging …
[HTML][HTML] Human action recognition and prediction: A survey
Derived from rapid advances in computer vision and machine learning, video analysis tasks
have been moving from inferring the present state to predicting the future state. Vision-based …
have been moving from inferring the present state to predicting the future state. Vision-based …
Rethinking the faster r-cnn architecture for temporal action localization
YW Chao, S Vijayanarasimhan… - Proceedings of the …, 2018 - openaccess.thecvf.com
We propose TAL-Net, an improved approach to temporal action localization in video that is
inspired by the Faster R-CNN object detection framework. TAL-Net addresses three key …
inspired by the Faster R-CNN object detection framework. TAL-Net addresses three key …
Peeking into the future: Predicting future person activities and locations in videos
Deciphering human behaviors to predict their future paths/trajectories and what they would
do from videos is important in many applications. Motivated by this idea, this paper studies …
do from videos is important in many applications. Motivated by this idea, this paper studies …
Eidetic 3D LSTM: A model for video prediction and beyond
Spatiotemporal predictive learning, though long considered to be a promising self-
supervised feature learning method, seldom shows its effectiveness beyond future video …
supervised feature learning method, seldom shows its effectiveness beyond future video …
Tall: Temporal activity localization via language query
This paper focuses on temporal localization of actions from untrimmed videos. Existing
methods typically involve training classifiers for a pre-defined list of actions and applying the …
methods typically involve training classifiers for a pre-defined list of actions and applying the …
R-c3d: Region convolutional 3d network for temporal activity detection
We address the problem of activity detection in continuous, untrimmed video streams. This is
a difficult task that requires extracting meaningful spatio-temporal features to capture …
a difficult task that requires extracting meaningful spatio-temporal features to capture …