Cross-view and Cross-pose Completion for 3D Human Understanding

M Armando, S Galaaoui, F Baradel… - Proceedings of the …, 2024 - openaccess.thecvf.com
Human perception and understanding is a major domain of computer vision which like many
other vision subdomains recently stands to gain from the use of large models pre-trained on …

Hap: Structure-aware masked image modeling for human-centric perception

J Yuan, X Zhang, H Zhou, J Wang… - Advances in …, 2024 - proceedings.neurips.cc
Abstract Model pre-training is essential in human-centric perception. In this paper, we first
introduce masked image modeling (MIM) as a pre-training approach for this task. Upon …

Know your self-supervised learning: A survey on image-based generative and discriminative training

U Ozbulak, HJ Lee, B Boga, ET Anzaku, H Park… - arXiv preprint arXiv …, 2023 - arxiv.org
Although supervised learning has been highly successful in improving the state-of-the-art in
the domain of image-based computer vision in the past, the margin of improvement has …

Unified Human-centric Model, Framework and Benchmark: A Survey

X Zhao, S Sulaiman, WY Leng - IEEE Access, 2024 - ieeexplore.ieee.org
Human-centric Computer Vision Tasks (HCTs) refer to a series of tasks related to the human
body, such as Human Pose Estimation, Pedestrian Tracking, Re-Identification (ReID) …

PoseEmbroider: Towards a 3D, Visual, Semantic-Aware Human Pose Representation

G Delmas, P Weinzaepfel, F Moreno-Noguer… - … on Computer Vision, 2025 - Springer
Aligning multiple modalities in a latent space, such as images and texts, has shown to
produce powerful semantic visual representations, fueling tasks like image captioning, text …

Adept: Annotation-denoising auxiliary tasks with discrete cosine transform map and keypoint for human-centric pretraining

W He, Y Yan, S Tang, Y Deng, Y Zhong, P Luo, D Qi - Neurocomputing, 2025 - Elsevier
Human-centric perception is the core of diverse computer vision tasks and has been a long-
standing research focus. However, previous research studied these human-centric tasks …

Multi Positive Contrastive Learning with Pose-Consistent Generated Images

S Inayoshi, AR Widya, S Ozaki, J Otsuka… - arXiv preprint arXiv …, 2024 - arxiv.org
Model pre-training has become essential in various recognition tasks. Meanwhile, with the
remarkable advancements in image generation models, pre-training methods utilizing …