Liftedcl: Lifting contrastive learning for human-centric perception

M Armando, S Galaaoui, F Baradel… - Proceedings of the …, 2024 - openaccess.thecvf.com

Human perception and understanding is a major domain of computer vision which like many
other vision subdomains recently stands to gain from the use of large models pre-trained on …

被引用次数：3 相关文章所有 3 个版本

[PDF] neurips.cc

Hap: Structure-aware masked image modeling for human-centric perception

J Yuan, X Zhang, H Zhou, J Wang… - Advances in …, 2024 - proceedings.neurips.cc

Abstract Model pre-training is essential in human-centric perception. In this paper, we first
introduce masked image modeling (MIM) as a pre-training approach for this task. Upon …

被引用次数：9 相关文章所有 5 个版本

[PDF] arxiv.org

Know your self-supervised learning: A survey on image-based generative and discriminative training

U Ozbulak, HJ Lee, B Boga, ET Anzaku, H Park… - arXiv preprint arXiv …, 2023 - arxiv.org

Although supervised learning has been highly successful in improving the state-of-the-art in
the domain of image-based computer vision in the past, the margin of improvement has …

被引用次数：33 相关文章所有 7 个版本

[PDF] ieee.org

Unified Human-centric Model, Framework and Benchmark: A Survey

X Zhao, S Sulaiman, WY Leng - IEEE Access, 2024 - ieeexplore.ieee.org

Human-centric Computer Vision Tasks (HCTs) refer to a series of tasks related to the human
body, such as Human Pose Estimation, Pedestrian Tracking, Re-Identification (ReID) …

PoseEmbroider: Towards a 3D, Visual, Semantic-Aware Human Pose Representation

G Delmas, P Weinzaepfel, F Moreno-Noguer… - … on Computer Vision, 2025 - Springer

Aligning multiple modalities in a latent space, such as images and texts, has shown to
produce powerful semantic visual representations, fueling tasks like image captioning, text …

Adept: Annotation-denoising auxiliary tasks with discrete cosine transform map and keypoint for human-centric pretraining

W He, Y Yan, S Tang, Y Deng, Y Zhong, P Luo, D Qi - Neurocomputing, 2025 - Elsevier

Human-centric perception is the core of diverse computer vision tasks and has been a long-
standing research focus. However, previous research studied these human-centric tasks …

[PDF] arxiv.org

Multi Positive Contrastive Learning with Pose-Consistent Generated Images

S Inayoshi, AR Widya, S Ozaki, J Otsuka… - arXiv preprint arXiv …, 2024 - arxiv.org

Model pre-training has become essential in various recognition tasks. Meanwhile, with the
remarkable advancements in image generation models, pre-training methods utilizing …