Articulated distance fields for ultra-fast tracking of hands interacting

M Li, L An, H Zhang, L Wu, F Chen… - Proceedings of the …, 2022 - openaccess.thecvf.com

Graph convolutional network (GCN) has achieved great success in single hand
reconstruction task, while interacting two-hand reconstruction by GCN remains unexplored …

被引用次数：90 相关文章所有 5 个版本

MEgATrack: monochrome egocentric articulated hand-tracking for virtual reality

S Han, B Liu, R Cabezas, CD Twigg, P Zhang… - ACM Transactions on …, 2020 - dl.acm.org

We present a system for real-time hand-tracking to drive virtual and augmented reality
(VR/AR) experiences. Using four fisheye monochrome cameras, our system generates …

被引用次数：199 相关文章

[PDF] thecvf.com

Stereonet: Guided hierarchical refinement for real-time edge-aware depth prediction

S Khamis, S Fanello, C Rhemann… - Proceedings of the …, 2018 - openaccess.thecvf.com

This paper presents StereoNet, the first end-to-end deep architecture for real-time stereo
matching that runs at 60 fps on an NVidia Titan X, producing high-quality, edge-preserved …

被引用次数：392 相关文章所有 12 个版本

[PDF] thecvf.com

Monocular total capture: Posing face, body, and hands in the wild

D Xiang, H Joo, Y Sheikh - … of the IEEE/CVF conference on …, 2019 - openaccess.thecvf.com

We present the first method to capture the 3D total motion of a target person from a
monocular view input. Given an image or a monocular video, our method reconstructs the …

被引用次数：358 相关文章所有 10 个版本

[PDF] thecvf.com

Resolving 3D human pose ambiguities with 3D scene constraints

M Hassan, V Choutas, D Tzionas… - Proceedings of the …, 2019 - openaccess.thecvf.com

To understand and analyze human behavior, we need to capture humans moving in, and
interacting with, the world. Most existing methods perform 3D human pose estimation …

被引用次数：269 相关文章所有 9 个版本

[PDF] arxiv.org

Nasa neural articulated shape approximation

B Deng, JP Lewis, T Jeruzalski, G Pons-Moll… - Computer Vision–ECCV …, 2020 - Springer

Efficient representation of articulated objects such as human bodies is an important problem
in computer vision and graphics. To efficiently simulate deformation, existing approaches …

被引用次数：237 相关文章所有 19 个版本

[PDF] thecvf.com

Keypoint transformer: Solving joint identification in challenging hands and object interactions for accurate 3d pose estimation

S Hampali, SD Sarkar, M Rad… - Proceedings of the …, 2022 - openaccess.thecvf.com

We propose a robust and accurate method for estimating the 3D poses of two hands in close
interaction from a single color image. This is a very challenging problem, as large occlusions …

被引用次数：106 相关文章所有 9 个版本

[PDF] thecvf.com

Monocular real-time hand shape and motion capture using multi-modal data

Y Zhou, M Habermann, W Xu… - Proceedings of the …, 2020 - openaccess.thecvf.com

We present a novel method for monocular hand shape and pose estimation at
unprecedented runtime performance of 100fps and at state-of-the-art accuracy. This is …

被引用次数：217 相关文章所有 8 个版本

[PDF] acm.org

Rgb2hands: real-time tracking of 3d hand interactions from monocular rgb video

J Wang, F Mueller, F Bernard, S Sorli… - ACM Transactions on …, 2020 - dl.acm.org

Tracking and reconstructing the 3D pose and geometry of two hands in interaction is a
challenging problem that has a high relevance for several human-computer interaction …

被引用次数：124 相关文章所有 10 个版本

[HTML] mdpi.com

[HTML][HTML] Audio-visual speech and gesture recognition by sensors of mobile devices

D Ryumin, D Ivanko, E Ryumina - Sensors, 2023 - mdpi.com

Audio-visual speech recognition (AVSR) is one of the most promising solutions for reliable
speech recognition, particularly when audio is corrupted by noise. Additional visual …

被引用次数：41 相关文章所有 9 个版本