Optical flow guided feature: A fast and robust motion representation for video action recognition

T Ahmad, L Jin, X Zhang, S Lai… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org

Video-based human action recognition is one of the most important and challenging areas
of research in the field of computer vision. Human action recognition has found many …

被引用次数：80 相关文章所有 4 个版本

Transformer for skeleton-based action recognition: A review of recent advances

W Xin, R Liu, Y Liu, Y Chen, W Yu, Q Miao - Neurocomputing, 2023 - Elsevier

Skeleton-based action recognition has rapidly become one of the most popular and
essential research topics in computer vision. The task is to analyze the characteristics of …

被引用次数：30 相关文章所有 2 个版本

[PDF] arxiv.org

Flowformer: A transformer architecture for optical flow

Z Huang, X Shi, C Zhang, Q Wang, KC Cheung… - European conference on …, 2022 - Springer

We introduce optical Flow transFormer, dubbed as FlowFormer, a transformer-based neural
network architecture for learning optical flow. FlowFormer tokenizes the 4D cost volume built …

被引用次数：241 相关文章所有 5 个版本

[PDF] thecvf.com

Flowformer++: Masked cost volume autoencoding for pretraining optical flow estimation

X Shi, Z Huang, D Li, M Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com

FlowFormer introduces a transformer architecture into optical flow estimation and achieves
state-of-the-art performance. The core component of FlowFormer is the transformer-based …

被引用次数：58 相关文章所有 5 个版本

[PDF] thecvf.com

X3d: Expanding architectures for efficient video recognition

C Feichtenhofer - Proceedings of the IEEE/CVF conference …, 2020 - openaccess.thecvf.com

This paper presents X3D, a family of efficient video networks that progressively expand a
tiny 2D image classification architecture along multiple network axes, in space, time, width …

被引用次数：1068 相关文章所有 7 个版本

[PDF] arxiv.org

Real-time intermediate flow estimation for video frame interpolation

Z Huang, T Zhang, W Heng, B Shi, S Zhou - European Conference on …, 2022 - Springer

Real-time video frame interpolation (VFI) is very useful in video processing, media players,
and display devices. We propose RIFE, a Real-time Intermediate Flow Estimation algorithm …

被引用次数：313 相关文章所有 7 个版本

[HTML] springer.com

[HTML][HTML] Vision-based human activity recognition: a survey

DR Beddiar, B Nini, M Sabokrou, A Hadid - Multimedia Tools and …, 2020 - Springer

Human activity recognition (HAR) systems attempt to automatically identify and analyze
human activities using acquired information from various types of sensors. Although several …

被引用次数：401 相关文章所有 8 个版本

[PDF] thecvf.com

Craft: Cross-attentional flow transformer for robust optical flow

X Sui, S Li, X Geng, Y Wu, X Xu, Y Liu… - Proceedings of the …, 2022 - openaccess.thecvf.com

Optical flow estimation aims to find the 2D motion field by identifying corresponding pixels
between two images. Despite the tremendous progress of deep learning-based optical flow …

被引用次数：97 相关文章所有 7 个版本

[PDF] thecvf.com

Stm: Spatiotemporal and motion encoding for action recognition

B Jiang, MM Wang, W Gan, W Wu… - Proceedings of the …, 2019 - openaccess.thecvf.com

Spatiotemporal and motion features are two complementary and crucial information for
video action recognition. Recent state-of-the-art methods adopt a 3D CNN stream to learn …

被引用次数：497 相关文章所有 6 个版本

[PDF] arxiv.org

A comprehensive study of deep video action recognition

Y Zhu, X Li, C Liu, M Zolfaghari, Y Xiong, C Wu… - arXiv preprint arXiv …, 2020 - arxiv.org

Video action recognition is one of the representative tasks for video understanding. Over the
last decade, we have witnessed great advancements in video action recognition thanks to …

被引用次数：203 相关文章所有 2 个版本