Human action recognition from various data modalities: A review
Human Action Recognition (HAR) aims to understand human behavior and assign a label to
each action. It has a wide range of applications, and therefore has been attracting increasing …
each action. It has a wide range of applications, and therefore has been attracting increasing …
Deep learning-based human pose estimation: A survey
Human pose estimation aims to locate the human body parts and build human body
representation (eg, body skeleton) from input data such as images and videos. It has drawn …
representation (eg, body skeleton) from input data such as images and videos. It has drawn …
Revisiting skeleton-based action recognition
Human skeleton, as a compact representation of human action, has received increasing
attention in recent years. Many skeleton-based action recognition methods adopt GCNs to …
attention in recent years. Many skeleton-based action recognition methods adopt GCNs to …
Star-transformer: a spatio-temporal cross attention transformer for human action recognition
In action recognition, although the combination of spatio-temporal videos and skeleton
features can improve the recognition performance, a separate model and balancing feature …
features can improve the recognition performance, a separate model and balancing feature …
Early, intermediate and late fusion strategies for robust deep learning-based multimodal action recognition
SY Boulahia, A Amamra, MR Madi, S Daikh - Machine Vision and …, 2021 - Springer
Multimodal action recognition techniques combine several image modalities (RGB, Depth,
Skeleton, and InfraRed) for a more robust recognition. According to the fusion level in the …
Skeleton, and InfraRed) for a more robust recognition. According to the fusion level in the …
Mmnet: A model-based multimodal network for human action recognition in rgb-d videos
Human action recognition (HAR) in RGB-D videos has been widely investigated since the
release of affordable depth sensors. Currently, unimodal approaches (eg, skeleton-based …
release of affordable depth sensors. Currently, unimodal approaches (eg, skeleton-based …
Toward proactive human–robot collaborative assembly: A multimodal transfer-learning-enabled action prediction approach
Human–robot collaborative assembly (HRCA) is vital for achieving high-level flexible
automation for mass personalization in today's smart factories. However, existing works in …
automation for mass personalization in today's smart factories. However, existing works in …
Multi-view action recognition using contrastive learning
In this work, we present a method for RGB-based action recognition using multi-view videos.
We present a supervised contrastive learning framework to learn a feature embedding …
We present a supervised contrastive learning framework to learn a feature embedding …
Multi-modal 3d human pose estimation with 2d weak supervision in autonomous driving
Abstract 3D human pose estimation (3D HPE) in autonomous vehicles (AV) differs from
other use cases in many factors, including the 3D resolution and range of data, absence of …
other use cases in many factors, including the 3D resolution and range of data, absence of …