A review of multimodal human activity recognition with special emphasis on classification, applications, challenges and future directions

SK Yadav, K Tiwari, HM Pandey, SA Akbar - Knowledge-Based Systems, 2021 - Elsevier
Human activity recognition (HAR) is one of the most important and challenging problems in
the computer vision. It has critical application in wide variety of tasks including gaming …

Epic-fusion: Audio-visual temporal binding for egocentric action recognition

E Kazakos, A Nagrani, A Zisserman… - Proceedings of the …, 2019 - openaccess.thecvf.com
We focus on multi-modal fusion for egocentric action recognition, and propose a novel
architecture for multi-modal temporal-binding, ie the combination of modalities within a …

Smart frame selection for action recognition

SN Gowda, M Rohrbach, L Sevilla-Lara - Proceedings of the AAAI …, 2021 - ojs.aaai.org
Video classification is computationally expensive. In this paper, we address theproblem of
frame selection to reduce the computational cost of video classification. Recent work has …

Grouped spatial-temporal aggregation for efficient action recognition

C Luo, AL Yuille - … of the IEEE/CVF international conference …, 2019 - openaccess.thecvf.com
Temporal reasoning is an important aspect of video analysis. 3D CNN shows good
performance by exploring spatial-temporal features jointly in an unconstrained way, but it …

What would you expect? anticipating egocentric actions with rolling-unrolling lstms and modality attention

A Furnari, GM Farinella - Proceedings of the IEEE/CVF …, 2019 - openaccess.thecvf.com
Egocentric action anticipation consists in understanding which objects the camera wearer
will interact with in the near future and which actions they will perform. We tackle the …

Interactive prototype learning for egocentric action recognition

X Wang, L Zhu, H Wang… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Egocentric video recognition is a challenging task that requires to identify both the actor's
motion and the active object that the actor interacts with. Recognizing the active object is …

Rolling-unrolling lstms for action anticipation from first-person video

A Furnari, GM Farinella - IEEE transactions on pattern analysis …, 2020 - ieeexplore.ieee.org
In this paper, we tackle the problem of egocentric action anticipation, ie, predicting what
actions the camera wearer will perform in the near future and which objects they will interact …

Lsta: Long short-term attention for egocentric action recognition

S Sudhakaran, S Escalera… - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com
Egocentric activity recognition is one of the most challenging tasks in video analysis. It
requires a fine-grained discrimination of small objects and their manipulation. While some …

Trear: Transformer-based rgb-d egocentric action recognition

X Li, Y Hou, P Wang, Z Gao, M Xu… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
In this article, we propose a transformer-based RGB-D egocentric action recognition
framework, called Trear. It consists of two modules: 1) interframe attention encoder and 2) …

An outlook into the future of egocentric vision

C Plizzari, G Goletto, A Furnari, S Bansal… - International Journal of …, 2024 - Springer
What will the future be? We wonder! In this survey, we explore the gap between current
research in egocentric vision and the ever-anticipated future, where wearable computing …