From Method to Application: A Review of Deep 3D Human Motion Capture

Z Niu, K Lu, J Xue, X Qin, J Wang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Motion capture technology is crucial in various applications like animation, virtual reality and
sports analysis. With the development of deep learning methods, significant progress has …

PhysPT: Physics-aware Pretrained Transformer for Estimating Human Dynamics from Monocular Videos

Y Zhang, JO Kephart, Z Cui, Q Ji - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
While current methods have shown promising progress on estimating 3D human motion
from monocular videos their motion estimates are often physically unrealistic because they …

Deep Patch Visual SLAM

L Lipson, Z Teed, J Deng - arXiv preprint arXiv:2408.01654, 2024 - arxiv.org
Recent work in visual SLAM has shown the effectiveness of using deep network backbones.
Despite excellent accuracy, however, such approaches are often expensive to run or do not …

ChatHuman: Language-driven 3D Human Understanding with Retrieval-Augmented Tool Reasoning

J Lin, Y Feng, W Liu, MJ Black - arXiv preprint arXiv:2405.04533, 2024 - arxiv.org
Numerous methods have been proposed to detect, estimate, and analyze properties of
people in images, including the estimation of 3D pose, shape, contact, human-object …

HumanPlus: Humanoid Shadowing and Imitation from Humans

Z Fu, Q Zhao, Q Wu, G Wetzstein, C Finn - arXiv preprint arXiv:2406.10454, 2024 - arxiv.org
One of the key arguments for building robots that have similar form factors to human beings
is that we can leverage the massive human data for training. Yet, doing so has remained …

Neural Localizer Fields for Continuous 3D Human Pose and Shape Estimation

I Sárándi, G Pons-Moll - arXiv preprint arXiv:2407.07532, 2024 - arxiv.org
With the explosive growth of available training data, single-image 3D human modeling is
ahead of a transition to a data-centric paradigm. A key to successfully exploiting data scale …

WiLoR: End-to-end 3D Hand Localization and Reconstruction in-the-wild

RA Potamias, J Zhang, J Deng, S Zafeiriou - arXiv preprint arXiv …, 2024 - arxiv.org
In recent years, 3D hand pose estimation methods have garnered significant attention due to
their extensive applications in human-computer interaction, virtual reality, and robotics. In …

COIN: Control-Inpainting Diffusion Prior for Human and Camera Motion Estimation

J Li, Y Yuan, D Rempe, H Zhang, P Molchanov… - arXiv preprint arXiv …, 2024 - arxiv.org
Estimating global human motion from moving cameras is challenging due to the
entanglement of human and camera motions. To mitigate the ambiguity, existing methods …

World-Grounded Human Motion Recovery via Gravity-View Coordinates

Z Shen, H Pi, Y Xia, Z Cen, S Peng, Z Hu, H Bao… - arXiv preprint arXiv …, 2024 - arxiv.org
We present a novel method for recovering world-grounded human motion from monocular
video. The main challenge lies in the ambiguity of defining the world coordinate system …

MotionDreamer: Zero-Shot 3D Mesh Animation from Video Diffusion Models

L Uzolas, E Eisemann, P Kellnhofer - arXiv preprint arXiv:2405.20155, 2024 - arxiv.org
Animation techniques bring digital 3D worlds and characters to life. However, manual
animation is tedious and automated techniques are often specialized to narrow shape …