Cross-view and Cross-pose Completion for 3D Human Understanding
M Armando, S Galaaoui, F Baradel… - Proceedings of the …, 2024 - openaccess.thecvf.com
Human perception and understanding is a major domain of computer vision which like many
other vision subdomains recently stands to gain from the use of large models pre-trained on …
other vision subdomains recently stands to gain from the use of large models pre-trained on …
Hap: Structure-aware masked image modeling for human-centric perception
Abstract Model pre-training is essential in human-centric perception. In this paper, we first
introduce masked image modeling (MIM) as a pre-training approach for this task. Upon …
introduce masked image modeling (MIM) as a pre-training approach for this task. Upon …
Know your self-supervised learning: A survey on image-based generative and discriminative training
Although supervised learning has been highly successful in improving the state-of-the-art in
the domain of image-based computer vision in the past, the margin of improvement has …
the domain of image-based computer vision in the past, the margin of improvement has …
Unified Human-centric Model, Framework and Benchmark: A Survey
Human-centric Computer Vision Tasks (HCTs) refer to a series of tasks related to the human
body, such as Human Pose Estimation, Pedestrian Tracking, Re-Identification (ReID) …
body, such as Human Pose Estimation, Pedestrian Tracking, Re-Identification (ReID) …
PoseEmbroider: Towards a 3D, Visual, Semantic-Aware Human Pose Representation
Aligning multiple modalities in a latent space, such as images and texts, has shown to
produce powerful semantic visual representations, fueling tasks like image captioning, text …
produce powerful semantic visual representations, fueling tasks like image captioning, text …
Adept: Annotation-denoising auxiliary tasks with discrete cosine transform map and keypoint for human-centric pretraining
Human-centric perception is the core of diverse computer vision tasks and has been a long-
standing research focus. However, previous research studied these human-centric tasks …
standing research focus. However, previous research studied these human-centric tasks …
Multi Positive Contrastive Learning with Pose-Consistent Generated Images
S Inayoshi, AR Widya, S Ozaki, J Otsuka… - arXiv preprint arXiv …, 2024 - arxiv.org
Model pre-training has become essential in various recognition tasks. Meanwhile, with the
remarkable advancements in image generation models, pre-training methods utilizing …
remarkable advancements in image generation models, pre-training methods utilizing …