Interacting attention graph for single image two-hand reconstruction

M Li, L An, H Zhang, L Wu, F Chen… - Proceedings of the …, 2022 - openaccess.thecvf.com
Graph convolutional network (GCN) has achieved great success in single hand
reconstruction task, while interacting two-hand reconstruction by GCN remains unexplored …

MEgATrack: monochrome egocentric articulated hand-tracking for virtual reality

S Han, B Liu, R Cabezas, CD Twigg, P Zhang… - ACM Transactions on …, 2020 - dl.acm.org
We present a system for real-time hand-tracking to drive virtual and augmented reality
(VR/AR) experiences. Using four fisheye monochrome cameras, our system generates …

Stereonet: Guided hierarchical refinement for real-time edge-aware depth prediction

S Khamis, S Fanello, C Rhemann… - Proceedings of the …, 2018 - openaccess.thecvf.com
This paper presents StereoNet, the first end-to-end deep architecture for real-time stereo
matching that runs at 60 fps on an NVidia Titan X, producing high-quality, edge-preserved …

Monocular total capture: Posing face, body, and hands in the wild

D Xiang, H Joo, Y Sheikh - … of the IEEE/CVF conference on …, 2019 - openaccess.thecvf.com
We present the first method to capture the 3D total motion of a target person from a
monocular view input. Given an image or a monocular video, our method reconstructs the …

Resolving 3D human pose ambiguities with 3D scene constraints

M Hassan, V Choutas, D Tzionas… - Proceedings of the …, 2019 - openaccess.thecvf.com
To understand and analyze human behavior, we need to capture humans moving in, and
interacting with, the world. Most existing methods perform 3D human pose estimation …

Nasa neural articulated shape approximation

B Deng, JP Lewis, T Jeruzalski, G Pons-Moll… - Computer Vision–ECCV …, 2020 - Springer
Efficient representation of articulated objects such as human bodies is an important problem
in computer vision and graphics. To efficiently simulate deformation, existing approaches …

Keypoint transformer: Solving joint identification in challenging hands and object interactions for accurate 3d pose estimation

S Hampali, SD Sarkar, M Rad… - Proceedings of the …, 2022 - openaccess.thecvf.com
We propose a robust and accurate method for estimating the 3D poses of two hands in close
interaction from a single color image. This is a very challenging problem, as large occlusions …

Monocular real-time hand shape and motion capture using multi-modal data

Y Zhou, M Habermann, W Xu… - Proceedings of the …, 2020 - openaccess.thecvf.com
We present a novel method for monocular hand shape and pose estimation at
unprecedented runtime performance of 100fps and at state-of-the-art accuracy. This is …

Rgb2hands: real-time tracking of 3d hand interactions from monocular rgb video

J Wang, F Mueller, F Bernard, S Sorli… - ACM Transactions on …, 2020 - dl.acm.org
Tracking and reconstructing the 3D pose and geometry of two hands in interaction is a
challenging problem that has a high relevance for several human-computer interaction …

[HTML][HTML] Audio-visual speech and gesture recognition by sensors of mobile devices

D Ryumin, D Ivanko, E Ryumina - Sensors, 2023 - mdpi.com
Audio-visual speech recognition (AVSR) is one of the most promising solutions for reliable
speech recognition, particularly when audio is corrupted by noise. Additional visual …