Deep learning-based human pose estimation: A survey
Human pose estimation aims to locate the human body parts and build human body
representation (eg, body skeleton) from input data such as images and videos. It has drawn …
representation (eg, body skeleton) from input data such as images and videos. It has drawn …
[HTML][HTML] Deep 3D human pose estimation: A review
Abstract Three-dimensional (3D) human pose estimation involves estimating the articulated
3D joint locations of a human body from an image or video. Due to its widespread …
3D joint locations of a human body from an image or video. Due to its widespread …
Synthetic data from diffusion models improves imagenet classification
Deep generative models are becoming increasingly powerful, now generating diverse high
fidelity photo-realistic samples given text prompts. Have they reached the point where …
fidelity photo-realistic samples given text prompts. Have they reached the point where …
Avatarclip: Zero-shot text-driven generation and animation of 3d avatars
3D avatar creation plays a crucial role in the digital age. However, the whole production
process is prohibitively time-consuming and labor-intensive. To democratize this technology …
process is prohibitively time-consuming and labor-intensive. To democratize this technology …
Stablerep: Synthetic images from text-to-image models make strong visual representation learners
We investigate the potential of learning visual representations using synthetic images
generated by text-to-image models. This is a natural question in the light of the excellent …
generated by text-to-image models. This is a natural question in the light of the excellent …
Fake it till you make it: Learning transferable representations from synthetic imagenet clones
Recent image generation models such as Stable Diffusion have exhibited an impressive
ability to generate fairly realistic images starting from a simple text prompt. Could such …
ability to generate fairly realistic images starting from a simple text prompt. Could such …
Bedlam: A synthetic dataset of bodies exhibiting detailed lifelike animated motion
We show, for the first time, that neural networks trained only on synthetic data achieve state-
of-the-art accuracy on the problem of 3D human pose and shape (HPS) estimation from real …
of-the-art accuracy on the problem of 3D human pose and shape (HPS) estimation from real …
Generative neural articulated radiance fields
Unsupervised learning of 3D-aware generative adversarial networks (GANs) using only
collections of single-view 2D photographs has very recently made much progress. These 3D …
collections of single-view 2D photographs has very recently made much progress. These 3D …
Fake it till you make it: face analysis in the wild using synthetic data alone
We demonstrate that it is possible to perform face-related computer vision in the wild using
synthetic data alone. The community has long enjoyed the benefits of synthesizing training …
synthetic data alone. The community has long enjoyed the benefits of synthesizing training …
Humor: 3d human motion model for robust pose estimation
We introduce HuMoR: a 3D Human Motion Model for Robust Estimation of temporal pose
and shape. Though substantial progress has been made in estimating 3D human motion …
and shape. Though substantial progress has been made in estimating 3D human motion …