Recovering 3d human mesh from monocular images: A survey

Y Tian, H Zhang, Y Liu, L Wang - IEEE transactions on pattern …, 2023 - ieeexplore.ieee.org
Estimating human pose and shape from monocular images is a long-standing problem in
computer vision. Since the release of statistical body models, 3D human mesh recovery has …

FastViT: A fast hybrid vision transformer using structural reparameterization

PKA Vasu, J Gabriel, J Zhu, O Tuzel… - Proceedings of the …, 2023 - openaccess.thecvf.com
The recent amalgamation of transformer and convolutional designs has led to steady
improvements in accuracy and efficiency of the models. In this work, we introduce FastViT, a …

Pymaf-x: Towards well-aligned full-body model regression from monocular images

H Zhang, Y Tian, Y Zhang, M Li, L An… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
We present PyMAF-X, a regression-based approach to recovering a parametric full-body
model from a single image. This task is very challenging since minor parametric deviation …

Frankmocap: A monocular 3d whole-body pose estimation system via regression and integration

Y Rong, T Shiratori, H Joo - Proceedings of the IEEE/CVF …, 2021 - openaccess.thecvf.com
Most existing monocular 3D pose estimation approaches only focus on a single body part,
neglecting the fact that the essential nuance of human motion is conveyed through a concert …

Interacting attention graph for single image two-hand reconstruction

M Li, L An, H Zhang, L Wu, F Chen… - Proceedings of the …, 2022 - openaccess.thecvf.com
Graph convolutional network (GCN) has achieved great success in single hand
reconstruction task, while interacting two-hand reconstruction by GCN remains unexplored …

Monocular expressive body regression through body-driven attention

V Choutas, G Pavlakos, T Bolkart, D Tzionas… - Computer Vision–ECCV …, 2020 - Springer
To understand how people look, interact, or perform tasks, we need to quickly and
accurately capture their 3D body, face, and hands together from an RGB image. Most …

Handoccnet: Occlusion-robust 3d hand mesh estimation network

JK Park, Y Oh, G Moon, H Choi… - Proceedings of the …, 2022 - openaccess.thecvf.com
Hands are often severely occluded by objects, which makes 3D hand mesh estimation
challenging. Previous works often have disregarded information at occluded regions …

A dataset of relighted 3D interacting hands

G Moon, S Saito, W Xu, R Joshi… - Advances in …, 2023 - proceedings.neurips.cc
The two-hand interaction is one of the most challenging signals to analyze due to the self-
similarity, complicated articulations, and occlusions of hands. Although several datasets …

Reconstructing hands in 3d with transformers

G Pavlakos, D Shan, I Radosavovic… - Proceedings of the …, 2024 - openaccess.thecvf.com
We present an approach that can reconstruct hands in 3D from monocular input. Our
approach for Hand Mesh Recovery HaMeR follows a fully transformer-based architecture …

Dexmv: Imitation learning for dexterous manipulation from human videos

Y Qin, YH Wu, S Liu, H Jiang, R Yang, Y Fu… - European Conference on …, 2022 - Springer
While significant progress has been made on understanding hand-object interactions in
computer vision, it is still very challenging for robots to perform complex dexterous …