RGB-D data-based action recognition: a review

MB Shaikh, D Chai - Sensors, 2021 - mdpi.com
Classification of human actions is an ongoing research problem in computer vision. This
review is aimed to scope current literature on data fusion and action recognition techniques …

G-tad: Sub-graph localization for temporal action detection

M Xu, C Zhao, DS Rojas, A Thabet… - Proceedings of the …, 2020 - openaccess.thecvf.com
Temporal action detection is a fundamental yet challenging task in video understanding.
Video context is a critical cue to effectively detect actions, but current works mainly focus on …

Action-net: Multipath excitation for action recognition

Z Wang, Q She, A Smolic - … of the IEEE/CVF conference on …, 2021 - openaccess.thecvf.com
Abstract Spatial-temporal, channel-wise, and motion patterns are three complementary and
crucial types of information for video action recognition. Conventional 2D CNNs are …

Tam: Temporal adaptive module for video recognition

Z Liu, L Wang, W Wu, C Qian… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Video data is with complex temporal dynamics due to various factors such as camera
motion, speed variation, and different activities. To effectively capture this diverse motion …

Meteornet: Deep learning on dynamic 3d point cloud sequences

X Liu, M Yan, J Bohg - Proceedings of the IEEE/CVF …, 2019 - openaccess.thecvf.com
Understanding dynamic 3D environment is crucial for robotic agents and many other
applications. We propose a novel neural network architecture called MeteorNet for learning …

Unsupervised learning of accurate siamese tracking

Q Shen, L Qiao, J Guo, P Li, X Li, B Li… - Proceedings of the …, 2022 - openaccess.thecvf.com
Unsupervised learning has been popular in various computer vision tasks, including visual
object tracking. However, prior unsupervised tracking approaches rely heavily on spatial …

Motionsqueeze: Neural motion feature learning for video understanding

H Kwon, M Kim, S Kwak, M Cho - … , Glasgow, UK, August 23–28, 2020 …, 2020 - Springer
Motion plays a crucial role in understanding videos and most state-of-the-art neural models
for video classification incorporate motion information typically using optical flows extracted …

Stand-alone inter-frame attention in video models

F Long, Z Qiu, Y Pan, T Yao, J Luo… - Proceedings of the …, 2022 - openaccess.thecvf.com
Motion, as the uniqueness of a video, has been critical to the development of video
understanding models. Modern deep learning models leverage motion by either executing …

Causal contextual prediction for learned image compression

Z Guo, Z Zhang, R Feng, Z Chen - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Over the past several years, we have witnessed impressive progress in the field of learned
image compression. Recent learned image codecs are commonly based on autoencoders …

Social adaptive module for weakly-supervised group activity recognition

R Yan, L Xie, J Tang, X Shu, Q Tian - … , Glasgow, UK, August 23–28, 2020 …, 2020 - Springer
This paper presents a new task named weakly-supervised group activity recognition (GAR)
which differs from conventional GAR tasks in that only video-level labels are available, yet …