Attention is all we need: Nailing down object-centric attention for egocentric activity recognition

A review of multimodal human activity recognition with special emphasis on classification, applications, challenges and future directions

SK Yadav, K Tiwari, HM Pandey, SA Akbar - Knowledge-Based Systems, 2021 - Elsevier

Human activity recognition (HAR) is one of the most important and challenging problems in
the computer vision. It has critical application in wide variety of tasks including gaming …

被引用次数：252 相关文章所有 3 个版本

[PDF] thecvf.com

Epic-fusion: Audio-visual temporal binding for egocentric action recognition

E Kazakos, A Nagrani, A Zisserman… - Proceedings of the …, 2019 - openaccess.thecvf.com

We focus on multi-modal fusion for egocentric action recognition, and propose a novel
architecture for multi-modal temporal-binding, ie the combination of modalities within a …

被引用次数：419 相关文章所有 15 个版本

[PDF] aaai.org

Smart frame selection for action recognition

SN Gowda, M Rohrbach, L Sevilla-Lara - Proceedings of the AAAI …, 2021 - ojs.aaai.org

Video classification is computationally expensive. In this paper, we address theproblem of
frame selection to reduce the computational cost of video classification. Recent work has …

被引用次数：181 相关文章所有 6 个版本

[PDF] thecvf.com

Grouped spatial-temporal aggregation for efficient action recognition

C Luo, AL Yuille - … of the IEEE/CVF international conference …, 2019 - openaccess.thecvf.com

Temporal reasoning is an important aspect of video analysis. 3D CNN shows good
performance by exploring spatial-temporal features jointly in an unconstrained way, but it …

被引用次数：204 相关文章所有 7 个版本

[PDF] thecvf.com

What would you expect? anticipating egocentric actions with rolling-unrolling lstms and modality attention

A Furnari, GM Farinella - Proceedings of the IEEE/CVF …, 2019 - openaccess.thecvf.com

Egocentric action anticipation consists in understanding which objects the camera wearer
will interact with in the near future and which actions they will perform. We tackle the …

被引用次数：216 相关文章所有 7 个版本

[PDF] thecvf.com

Interactive prototype learning for egocentric action recognition

X Wang, L Zhu, H Wang… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com

Egocentric video recognition is a challenging task that requires to identify both the actor's
motion and the active object that the actor interacts with. Recognizing the active object is …

被引用次数：74 相关文章所有 6 个版本

[PDF] arxiv.org

Rolling-unrolling lstms for action anticipation from first-person video

A Furnari, GM Farinella - IEEE transactions on pattern analysis …, 2020 - ieeexplore.ieee.org

In this paper, we tackle the problem of egocentric action anticipation, ie, predicting what
actions the camera wearer will perform in the near future and which objects they will interact …

被引用次数：172 相关文章所有 7 个版本

[PDF] thecvf.com

Lsta: Long short-term attention for egocentric action recognition

S Sudhakaran, S Escalera… - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com

Egocentric activity recognition is one of the most challenging tasks in video analysis. It
requires a fine-grained discrimination of small objects and their manipulation. While some …

被引用次数：196 相关文章所有 18 个版本

[PDF] arxiv.org

Trear: Transformer-based rgb-d egocentric action recognition

X Li, Y Hou, P Wang, Z Gao, M Xu… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org

In this article, we propose a transformer-based RGB-D egocentric action recognition
framework, called Trear. It consists of two modules: 1) interframe attention encoder and 2) …

被引用次数：105 相关文章所有 7 个版本

[PDF] springer.com

An outlook into the future of egocentric vision

C Plizzari, G Goletto, A Furnari, S Bansal… - International Journal of …, 2024 - Springer

What will the future be? We wonder! In this survey, we explore the gap between current
research in egocentric vision and the ever-anticipated future, where wearable computing …

被引用次数：32 相关文章所有 7 个版本