[HTML][HTML] Deep 3D human pose estimation: A review

J Wang, S Tan, X Zhen, S Xu, F Zheng, Z He… - Computer Vision and …, 2021 - Elsevier
Abstract Three-dimensional (3D) human pose estimation involves estimating the articulated
3D joint locations of a human body from an image or video. Due to its widespread …

Recovering 3d human mesh from monocular images: A survey

Y Tian, H Zhang, Y Liu, L Wang - IEEE transactions on pattern …, 2023 - ieeexplore.ieee.org
Estimating human pose and shape from monocular images is a long-standing problem in
computer vision. Since the release of statistical body models, 3D human mesh recovery has …

Humans in 4D: Reconstructing and tracking humans with transformers

S Goel, G Pavlakos, J Rajasegaran… - Proceedings of the …, 2023 - openaccess.thecvf.com
We present an approach to reconstruct humans and track them over time. At the core of our
approach, we propose a fully" transformerized" version of a network for human mesh …

Bedlam: A synthetic dataset of bodies exhibiting detailed lifelike animated motion

MJ Black, P Patel, J Tesch… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
We show, for the first time, that neural networks trained only on synthetic data achieve state-
of-the-art accuracy on the problem of 3D human pose and shape (HPS) estimation from real …

End-to-end human pose and mesh reconstruction with transformers

K Lin, L Wang, Z Liu - … of the IEEE/CVF conference on …, 2021 - openaccess.thecvf.com
We present a new method, called MEsh TRansfOrmer (METRO), to reconstruct 3D human
pose and mesh vertices from a single image. Our method uses a transformer encoder to …

Pymaf: 3d human pose and shape regression with pyramidal mesh alignment feedback loop

H Zhang, Y Tian, X Zhou, W Ouyang… - Proceedings of the …, 2021 - openaccess.thecvf.com
Regression-based methods have recently shown promising results in reconstructing human
meshes from monocular images. By directly mapping raw pixels to model parameters, these …

Banmo: Building animatable 3d neural models from many casual videos

G Yang, M Vo, N Neverova… - Proceedings of the …, 2022 - openaccess.thecvf.com
Prior work for articulated 3D shape reconstruction often relies on specialized multi-view and
depth sensors or pre-built deformable 3D models. Such methods do not scale to diverse sets …

Pymaf-x: Towards well-aligned full-body model regression from monocular images

H Zhang, Y Tian, Y Zhang, M Li, L An… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
We present PyMAF-X, a regression-based approach to recovering a parametric full-body
model from a single image. This task is very challenging since minor parametric deviation …

I2l-meshnet: Image-to-lixel prediction network for accurate 3d human pose and mesh estimation from a single rgb image

G Moon, KM Lee - Computer Vision–ECCV 2020: 16th European …, 2020 - Springer
Most of the previous image-based 3D human pose and mesh estimation methods estimate
parameters of the human mesh model from an input image. However, directly regressing the …

Pifuhd: Multi-level pixel-aligned implicit function for high-resolution 3d human digitization

S Saito, T Simon, J Saragih… - Proceedings of the IEEE …, 2020 - openaccess.thecvf.com
Recent advances in image-based 3D human shape estimation have been driven by the
significant improvement in representation power afforded by deep neural networks …