Human image generation: A comprehensive survey

Z Jia, Z Zhang, L Wang, T Tan - ACM Computing Surveys, 2024 - dl.acm.org
Image and video synthesis has become a blooming topic in computer vision and machine
learning communities along with the developments of deep generative models, due to its …

VS: Reconstructing Clothed 3D Human from Single Image via Vertex Shift

L Liu, Y Li, Y Gao, C Gao, Y Liu… - Proceedings of the …, 2024 - openaccess.thecvf.com
Various applications require high-fidelity and artifact-free 3D human reconstructions.
However current implicit function-based methods inevitably produce artifacts while existing …

Single-view 3d human digitalization with large reconstruction models

Z Weng, J Liu, H Tan, Z Xu, Y Zhou… - arXiv preprint arXiv …, 2024 - arxiv.org
In this paper, we introduce Human-LRM, a single-stage feed-forward Large Reconstruction
Model designed to predict human Neural Radiance Fields (NeRF) from a single image. Our …

DreamWaltz-G: Expressive 3D Gaussian Avatars from Skeleton-Guided 2D Diffusion

Y Huang, J Wang, A Zeng, ZJ Zha, L Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org
Leveraging pretrained 2D diffusion models and score distillation sampling (SDS), recent
methods have shown promising results for text-to-3D avatar generation. However …

DiffusionRegPose: Enhancing Multi-Person Pose Estimation using a Diffusion-Based End-to-End Regression Approach

D Tan, H Chen, W Tian, L Xiong - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
This paper presents the DiffusionRegPose a novel approach to multi-person pose
estimation that converts a one-stage end-to-end keypoint regression model into a diffusion …

Innovative AI techniques for photorealistic 3D clothed human reconstruction from monocular images or videos: a survey

S Yang, X Gu, Z Kuang, F Qin, Z Wu - The Visual Computer, 2024 - Springer
The reconstruction of high-quality 3D clothed humans from monocular images or videos has
gained popularity in recent years due to its significant practical applications. While several …

CanonicalFusion: Generating Drivable 3D Human Avatars from Multiple Images

J Shin, J Lee, S Lee, MG Park, JM Kang… - … on Computer Vision, 2025 - Springer
We present a novel framework for reconstructing animatable human avatars from multiple
images, termed CanonicalFusion. Our central concept involves integrating individual …

GaussianAvatar: Human avatar Gaussian splatting from monocular videos

H Lin, Y Zhan - Computers & Graphics, 2024 - Elsevier
Many application fields including virtual reality and movie production demand reconstructing
high-quality digital human avatars from monocular videos and real-time rendering. However …

[HTML][HTML] Enhanced Multi-Scale Attention-Driven 3D Human Reconstruction from Single Image

Y Ren, M Zhou, P Zhou, S Wang, Y Liu, G Geng, K Li… - Electronics, 2024 - mdpi.com
Due to the inherent limitations of a single viewpoint, reconstructing 3D human meshes from
a single image has long been a challenging task. While deep learning networks enable us …

Multi-hypotheses Conditioned Point Cloud Diffusion for 3D Human Reconstruction from Occluded Images

D Kim, TK Kim - arXiv preprint arXiv:2409.18364, 2024 - arxiv.org
3D human shape reconstruction under severe occlusion due to human-object or human-
human interaction is a challenging problem. Parametric models ie, SMPL (-X), which are …