Weakly-supervised mesh-convolutional hand reconstruction in the wild

Y Tian, H Zhang, Y Liu, L Wang - IEEE transactions on pattern …, 2023 - ieeexplore.ieee.org

Estimating human pose and shape from monocular images is a long-standing problem in
computer vision. Since the release of statistical body models, 3D human mesh recovery has …

被引用次数：110 相关文章所有 8 个版本

[PDF] thecvf.com

FastViT: A fast hybrid vision transformer using structural reparameterization

PKA Vasu, J Gabriel, J Zhu, O Tuzel… - Proceedings of the …, 2023 - openaccess.thecvf.com

The recent amalgamation of transformer and convolutional designs has led to steady
improvements in accuracy and efficiency of the models. In this work, we introduce FastViT, a …

被引用次数：76 相关文章所有 5 个版本

[PDF] arxiv.org

Pymaf-x: Towards well-aligned full-body model regression from monocular images

H Zhang, Y Tian, Y Zhang, M Li, L An… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

We present PyMAF-X, a regression-based approach to recovering a parametric full-body
model from a single image. This task is very challenging since minor parametric deviation …

被引用次数：106 相关文章所有 7 个版本

[PDF] thecvf.com

Frankmocap: A monocular 3d whole-body pose estimation system via regression and integration

Y Rong, T Shiratori, H Joo - Proceedings of the IEEE/CVF …, 2021 - openaccess.thecvf.com

Most existing monocular 3D pose estimation approaches only focus on a single body part,
neglecting the fact that the essential nuance of human motion is conveyed through a concert …

被引用次数：179 相关文章所有 8 个版本

[PDF] thecvf.com

Interacting attention graph for single image two-hand reconstruction

M Li, L An, H Zhang, L Wu, F Chen… - Proceedings of the …, 2022 - openaccess.thecvf.com

Graph convolutional network (GCN) has achieved great success in single hand
reconstruction task, while interacting two-hand reconstruction by GCN remains unexplored …

被引用次数：93 相关文章所有 5 个版本

Monocular expressive body regression through body-driven attention

V Choutas, G Pavlakos, T Bolkart, D Tzionas… - Computer Vision–ECCV …, 2020 - Springer

To understand how people look, interact, or perform tasks, we need to quickly and
accurately capture their 3D body, face, and hands together from an RGB image. Most …

被引用次数：232 相关文章所有 5 个版本

[PDF] thecvf.com

Handoccnet: Occlusion-robust 3d hand mesh estimation network

JK Park, Y Oh, G Moon, H Choi… - Proceedings of the …, 2022 - openaccess.thecvf.com

Hands are often severely occluded by objects, which makes 3D hand mesh estimation
challenging. Previous works often have disregarded information at occluded regions …

被引用次数：85 相关文章所有 5 个版本

[PDF] thecvf.com

Reconstructing hands in 3d with transformers

G Pavlakos, D Shan, I Radosavovic… - Proceedings of the …, 2024 - openaccess.thecvf.com

We present an approach that can reconstruct hands in 3D from monocular input. Our
approach for Hand Mesh Recovery HaMeR follows a fully transformer-based architecture …

被引用次数：16 相关文章所有 3 个版本

[PDF] arxiv.org

Dexmv: Imitation learning for dexterous manipulation from human videos

Y Qin, YH Wu, S Liu, H Jiang, R Yang, Y Fu… - European Conference on …, 2022 - Springer

While significant progress has been made on understanding hand-object interactions in
computer vision, it is still very challenging for robots to perform complex dexterous …

被引用次数：118 相关文章所有 7 个版本

[PDF] thecvf.com

Semi-supervised 3d hand-object poses estimation with interactions in time

S Liu, H Jiang, J Xu, S Liu… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com

Estimating 3D hand and object pose from a single image is an extremely challenging
problem: hands and objects are often self-occluded during interactions, and the 3D …

被引用次数：140 相关文章所有 7 个版本