Searching multi-rate and multi-modal temporal enhanced networks for gesture recognition
Gesture recognition has attracted considerable attention owing to its great potential in
applications. Although the great progress has been made recently in multi-modal learning …
applications. Although the great progress has been made recently in multi-modal learning …
Decoupling and recoupling spatiotemporal representation for rgb-d-based motion recognition
Decoupling spatiotemporal representation refers to decomposing the spatial and temporal
features into dimension-independent factors. Although previous RGB-D-based motion …
features into dimension-independent factors. Although previous RGB-D-based motion …
Highly-optimized radar-based gesture recognition system with depthwise expansion module
The increasing integration of technology in our daily lives demands the development of
more convenient human–computer interaction (HCI) methods. Most of the current hand …
more convenient human–computer interaction (HCI) methods. Most of the current hand …
Event-based diffractive neural network chip for dynamic action recognition
Dynamic action recognition has promising applications in human–computer interaction,
information encryption, and high-speed image processing. However, it is challenging for a …
information encryption, and high-speed image processing. However, it is challenging for a …
Attention-based hand semantic segmentation and gesture recognition using deep networks
The ability to discern the shape of hands can be a vital issue in improving the performance
of hand gesture recognition for human–computer interaction. Segmentation itself is a very …
of hand gesture recognition for human–computer interaction. Segmentation itself is a very …
Real-time monocular skeleton-based hand gesture recognition using 3D-Jointsformer
Automatic hand gesture recognition in video sequences has widespread applications,
ranging from home automation to sign language interpretation and clinical operations. The …
ranging from home automation to sign language interpretation and clinical operations. The …
Multi-scale attention 3D convolutional network for multimodal gesture recognition
Gesture recognition is an important direction in computer vision research. Information from
the hands is crucial in this task. However, current methods consistently achieve attention on …
the hands is crucial in this task. However, current methods consistently achieve attention on …
A Unified Multimodal De- and Re-Coupling Framework for RGB-D Motion Recognition
Motion recognition is a promising direction in computer vision, but the training of video
classification models is much harder than images due to insufficient data and considerable …
classification models is much harder than images due to insufficient data and considerable …
Multimodal fusion hierarchical self-attention network for dynamic hand gesture recognition
P Balaji, MR Prusty - Journal of Visual Communication and Image …, 2024 - Elsevier
Recent improvements in dynamic hand gesture recognition have seen a shift from traditional
convolutional architectures to attention-based networks. These attention networks have …
convolutional architectures to attention-based networks. These attention networks have …
GestFormer: Multiscale Wavelet Pooling Transformer Network for Dynamic Hand Gesture Recognition
M Garg, D Ghosh, PM Pradhan - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Transformer models have achieved state-of-the-art results in many applications like NLP
classification etc. But their exploration in gesture recognition task is still limited. So we …
classification etc. But their exploration in gesture recognition task is still limited. So we …